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Methods and Compositions for Treating Abnormal Cell Growth Related to 
Unwanted Guanine Nucleotide Exchange Factor Activity 
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FIELD OF THE INVENTION 

This invention is in the field of molecular biology, and involves methods and 
compositions for regulating unwanted cell growth through the regulation of the activity 
of certain guanine nucleotide exchange factors. 

15 BACKGROUND OF THE INVENTION 

Ras is a member of a superfamily of GTPases that regulate diverse signaling 
pathways. Ras itself has been shown to be involved in regulating cell growth and 
differentiation (See, Boguski, M. S. and McCormick, F. (1993) Nature 366. 

643-654). A subfamily of Ras consists of Rho. Rac. and Cdc42. These GTPase have 

20 also been shown to be involved in regulating cell growth, particularly as relating. to 

cellular transformation, as well as controlling the formation of focal contacts and 
alterations in the actin cytoskeleton which occur upon growth factor stimulation (See. 
Coso, O. A.. Chiariello. M., Yu, J.-C, Teramoto. H., Crespo. P.. Xu, N.. Miki, T. and 
Gutkind.J. S. (1995) G?// 81.1137-1146: Hill. C. S.. Wynne, J. and Treisman, R. 

25 (1995)G?// 81,1159-1170: Kozma, R., Ahmed, S.. Best, A. and Lim, L. (1995) Mol. 

Cell. Bioi 15, 1942-1952: Minden. A.. Lin. A., Claret, F.-X., Abo, A. and Karin. M. 
(1995) Cell 81,1147-1157: Nobes, C D. and Hall. A. (1995) Cell 81,53-62: Olson, 
M. F.. Ashworth. A. and Hall. A. (1995) Science 269. 1270-1272). Common to all 
Ras family members is their ability to cycle between inactive (GDP bound) and active 

30 (GTP bound) states. In this regard, these GTPases act as molecular switches, capable 

of processing information and then disseminating that information to control a specific 
pathway. 

This property of cycling between GTP and GDP states has provided a means to 
identify and purify proteins which regulate the nucleotide state of Ras and Ras related 
35 GTPases. See. Boguski, M. S. and McCormick. F. 0993) Nature 366, 643-654. 
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By monitoring the hydrolysis of GTP to GDP. GTPase activating proteins 
(GAPs) have been characterized for many members of the Ras family. See Boguski 
M. S. and McCormick. F. (1993) Nature 366. 643-654: Barfod. E. T.. Zheng Y 
Kuang. W,J.. Hart. M. J.. Evans. T.. Cerione. R. A. and Ashkenaz. A. (1993) J Biol 
Cnenu 268. 26059-26062: Lamarche. N. and Hal.. A. (.994) Trend, Genet. 10 436- 
440: Cerione. R. A. and Zheng. Y. (1996) Current Opinion in Cell Biology 8 216- 
222. The latter reference provides a good discussion of the properties of those proteins 
that atlect the guanine nucleotide state of Ras and Ras related proteins. Guanine 
nucleotide dissociation inhibitors (GDIs) were identified based on their ability to 
.ambit the dissociation of GDP. It has subsequently been determined that they also 
bind to the GTP state, inhibiting the intrinsic and GAP stimulated GTP hydrolysis 
See. Boguski. M. S. and McCormick. F. (1993) Nature 366. 643-654 In general 
GAPs and effectors have a high affinity for the GTP-bound state, while GDI proteins 
bind most tightly to the GDP-bound state. These properties have been exploited to 
purify effectors for Cdc42Hs ( See, Bagrodia. S.. Taylor. S. J.. Creasy C L 
Chemoff. J. and Cerione. R. A. (1995) / Biol. Che,n. 270. 22731-22737: Manser E 
Leung. T. Salihuddin. H.. Zhao. Z,s. and Lim. L. (Mature 367. 40-46: Martin" 
G. A.. Bollag. G.. McCormick. F. and Abo. A. (1995) EMBO J. 14. 1970-1978) Ras 
(See. Moodie. S. A.. Wi.lumsen. B. M.. Weber. M. J. and Wolfman. A. (1993) Science 
260. 1658-1661: Rodriguez-Viciana. P.. Warne. P. H.. Dhand. R.. Vanhaesebroeck. 

B.. Gout. I.. Fry. M. J.. Waterfield. M. D. and Downward. J. (1994) Nature 
370. 527-532) and Rho (See. Leung. T.. Manser. E.. Tan. L. and Lim. L. (.995) J 
Biol. Cnem. 270. 29051-29054: Watanabe. G.. Saito. Y.. Madaule. P.. Ishizaki T ' 
Fujisawa. K.. Morii. N.. Mukai. H.. Ono. Y.. Kakizuki. A. and Narumiya. S. (1996) 
Sctence 271. 645-648). An affinity approach has also been employed with Cdc42Hs- 
GTP and has led to the characterization of IQGAP1. a potential mediator for observed 
cytoskeletal events induced by Cdc42. See. Hart. M. J.. Callow. M.. Souza. B and 
Polakis. P. (1996) EMBO J. 15. 2997-3005. 

A modification of this affinity approach can a.so be used to identify and purify 
guanme nucleotide exchange factors (GEFs). GEFs can be distinguished from other 
regulatory proteins by their ability to interact preferentially with the nucleotide- 
depleted state of G-proteins. See. Hart. M. J.. Eva. A.. Zangrilli. D.. Aaronson. S A 
Evans. T. Cerione. R. A. and Zheng. Y. (.994) / Biol. Chen, 269. 62-65: Mosteller 
°- Han - 1 Br0eL D < 1994 > «oL Cell. Biol. 14. 1 104-11 12. By stimulate 
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the dissociation of GDP and subsequent binding of GTP. GEFs play an important role 
in the activation of Ras-like proteins. For example, Ras is converted to its GTP-bound 
form by the growth-factor stimulated translocation of Sos. a Ras-specific GEF. See, 
Buday. L. and Downward, J. (1993)CW/ 73. 61 1-620. 
5 The characterization of GEFs that specifically activate Rac family members 

will help elucidate signalling pathways in which these GTPases participate, and thus 
lead to a better understanding of the molecular basis of cell growth. This, in turn, will 
enable the identification of drugs for preventing or treating diseases where 
uncontrolled cell growth is the cause. Because Rac plays a key role in signal 

1 0 transduction and cell growth, the identification and properties of Rac GEFs is presently 

receiving considerable scientific attention. One such Rac GEF is known. Tiam- 1 . 
See, Michiels, F., Habets, G.G., Stam, J.C., van der Kammen, R.A., and Collard, 
J.G. (1995) Nature 375,338-340. See also. Eva. A. and Aaronson. S. A. (1985) 
Nature 316, 273-275: Toksoz. D. and Williams, D. A. (1994) Oncogene 9, 621-628. 

15 DESCRIPTION OF THE INVENTION 

The present invention relates to all aspects of a guanine exchange factor (GEF), 
in particular, a Rac-GEF. A GEF modulates cell signaling pathways, both in vitro and 
in vivo, by modulating the activity of a GTPase. By way of illustration, a Rac-GEF, 
which modulates the activity of a Rac GTPase. is described. However, the present 

20 invention relates to other GEFs. especially other Rac-GEFs. 

The present invention preferably relates to an isolated Rac-GEF polypeptide 
characterized by having a Src homology, Dbl homology and pleckstrin homology 
domains, and variants thereof, or fragments of such polypeptides, nucleic acids coding 
for such Rac-GEFs or nucleic acid fragments, and derivatives of the polypeptides and 

25 nucleic acids. 

The invention also relates to methods of using such polypeptides, nucleic acids, 
or derivatives thereof, e.g.. in therapeutics, diagnostics, and as research tools. 

Another aspect of the present invention involves antibodies and other ligands 
which recognize the invention Rac-GEF, regulators of Rac-GEF activity and other 

30 GEFs. and methods of treating pathological conditions associated or related to such 

Rac GTPase. 

The invention also relates to methods of testing for and/or identifying agents 
which regulate GEF by measuring their effect on GEF activity, e.g., in binding to a 
GTPase and/or nucleotide exchange activity. 
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The invention also relates to methods of assaying tor GEF activity, preferrably 
using activators of GEF activity. 

These and other aspects of the invention will become apparent upon a full 
considertion of the following disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the complete nucleotide sequence (SEQ ID NO:l) and deduced 
amino acid sequence (SEQ ID NO:2) for a p ol y P eptide encoded for by a human GEF- 
Rac gene. 

FIG. 2 shows the brain specific nucleotide sequence for a Rac-GEF. 
FIG. 3 shows the domain structures of full length Tiam-1. and truncations of 
the molecule. 

FIG. 4 shows the stimulatory effect of ascorbyl stearate on Rac exchange 
activity by various forms of truncated Tiam- . . the 85kd and 1 35 kD molecules. 

FIG. 5 shows the effects of certain ascorbyl compounds, inositol lipids and 
phospholipids on Tiam-1 stimulated Rac-GEF activity. 

FIG. 6 shows the effects of ascorbyl sterate on Tiam- 1 constructs that have PH 
and DH domains. 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, a novel polypeptide and 
nucleic acid coding for a Rac-GEF has been identified and isolated. Alternate 
variants of the molecule have also beeen identified. As used herein, Rac-GEF 
means a polypeptide, or a nucleic acid coding for a Rac-GEF polypeptide 
which polypeptide has a specific binding affinity for a guanine nucleotide- 
depleted state of G-proteins (in particular Rac), a guanine nucleotide exchange 
activity, an oncogenic transforming activity, and an immunogenic activity. By 
specific binding affinity, it is meant that the polypeptide has a binding 
preference for the nucleotide-depleted state of the G-protein, in contrast, e.g., 
to the GDP- or GTP-bound state of the G-protein which is preferentially bound 
by other regulatory proteins. By guanine nucleotide exchange activity, it is 
meant that the polypeptide stimulates or catalyzes the dissociation of GDP 
from a G-protein, such as Rac, and subsequent binding of GTP. By cellular 
oncogenic transforming activity, it is meant that introduction of a nucleic acid 
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coding for Rac-GEF into a cell line, e.g., NIH 3T3 cells, confers a transformed 
phenotype on such cells. A transformed phenotype can be measured by foci 
formation, e.g., as characterized and described by Eva and Aaronson, Nature, 
316:273-276, 1985. Immunogenic activity means that the polypeptide binds to' 
5 Rac-GEF specific antibodies 6r is capable of eliciting an immune response 

specific for a Rac-GEF. Immunogenic activities are discussed below. The 
above-mentioned activities of a Rac-GEF polypeptide can be assayed, e.g., as 
described below in the examples or according to methods which the skilled 
worker would know. A Rac-GEF polypeptide, or corresponding nucleic acid 

10 coding for it, means a polypeptide which can be isolated from a natural 

source. It therefore includes naturally-occurring normal and mutant alleles. 
Natural sources include, e.g., living cells obtained from tissues and whole 
organisms, and cultured cell lines. 

To identify a human gene that encodes a Rac-GEF, we performed a 

15 search of the EST data base for Dlb homologs. The search was performed 

using an amino acid sequence (residues 1-519) encoded by the human TIM 
protein (Chan et al., 1994, Ocogene, Vol. 9, pages 1057-1063). A single clone 
was identified, and the plasmid encoding this insert was purchased via the 
I.M.A.G.E. Consortium (Research Genetics). Using this cDNA as template, a 

20 511-bp "P-labelled PCR product was produced using oligos 

5'-GGAGGCCATGTTCGAGCTGG-3' and 

5'- GCTGATC ATCTGTTCCGTGC-3' (5' and 3' primers, respectively) and K P 
labelled nucleotides. This labeled PCR fragment was used as a probe to screen 
approximately 4 x 10 s clones of a human fetal brain Lambda ZAP cDNA 

25 library (Stratagene). A clone with an insert of 2.6-kb was isolated, and the 

complete DNA sequence of this clone was determined and shown to have a 
single open reading frame of 1950-bp that is predicted to encode a 650-amino 
acids protein with a calculated molecular mass of 74.7 kDa. A comparision of 
the DNA sequences of the EST insert to the fetal brain cDNA revealed a 72 

30 v base pair insert in the fetal brain sequence. The insert is in the DH domain. 
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As discussed more in the Examples, Northern analysis revealed a 3.5 kb 
transcript in brain tissue and a 4 kb transcript in liver. Consequently, using 
the additional sequence identified from the 2.6-kb sequence that was not 
present in EST #167059 we identified another EST (#109922) that had been 
isolated from a human cDNA liver library. The plasmid containing this insert 
was also obtained, and the insert sequenced which revealed an initiating 
methionine. 

Fig. 1 (SEQ ID NO(s): 1 and 2) show the alignment of the full length 
liver nucleotide cDNA sequence, with its deduced amino acid sequence, 
respectively. It is note worthy that this sequence has an additional 126 amino 
acids which differ from the amino-terminal 66 amino acids of the 2.6 kb brain 
cloned (Fig. 2). Also shown in the figure are various domains, including the 
Src homology 3, Dbl homology and pleckstrin homology domains. It, or its 
corresponding gene, can be isolated from natural sources. Characterization of 
a human Rac-GEF is described below and in the examples. 

It is noteworthy that because of the protein-protein interactive 
properties of the Src homology 3 domain, ligands that bind to this domain 
may be identified, for example, by screening an expression library, that affect 
Rac-GEF activity. Such ligands would have medical applications. 

The present invention also relates to polypeptide fragments of Rac- 
GEF. The fragments are preferably biologically-active. By biologically-active, 
it is meant that the polypeptide fragment possesses an activity in a living 
system or with components of a living system. Biological-activities include: a 
specific binding affinity for a guanine nucleotide-depleted state of G-proteins, 
in particular Rac, a guanine nucleotide exchange activity, an oncogenic 
transforming activity, an immunogenic activity, modulating the binding 
between a Rac-GEF and a Rac GTPase, or acting as an agonist or antagonist of 
Rac GTPase activity. Such activities can be assayed routinely, e.g., according to 
the methods described above and below. Various fragments can be prepared. 
See the examples below for further discussion. Fragments can also be selected 
in which one or more of the mentioned activities are eliminated or altered 



6 



WO 98/57990 



PCT/US98/12391 



when compared to Rac-GEF. As described in the examples, such fragments 
can be prepared routinely, e.g., by recombinant means or by proteolytic 
cleavage of isolated polypeptides, and then assayed for a desired activity. 

The present invention also relates to a human Rac-GEF specific amino 
5 acid sequence as set forth in Fig. 1 (SEQ ID NO: 2): A clone encoding such 

sequence, 128 to 711 amino acids and also containing 66 divergent amino 
acids as shown in Figure 2, has been deposited on December 11, 1996 with the 
American Type Culture Collection with Accession No. 98273. A Rac-GEF 
specific amino acid sequence means a defined amino acid sequence. A specific 

10 amino acid sequence can be found routinely, e.g., by searching a gene/protein 

database using the BLAST set of computer programs. A Rac-GEF specific 
amino acid sequence can be useful to produce peptides as antigens to generate 
an immune response specific for Rac-GEF. Antibodies obtained by such 
immunization can be used as a specific probe for the Rac-GEF protein for 

15 diagnostic or research purposes. Such peptides can also be used to inhibit the 

Rac-GEF binding to Rac to modulate pathological conditions in cells. 

A polypeptide of the invention, e.g., having a polypeptide sequence as 
shown in Fig. 1 (SEQ ID NO: 2), can by analyzed by available methods to 
identify structural and/ or functional domains in the polypeptide. For 

20 example, when the polypeptide coding sequence set forth in Fig. 1 (SEQ ID 

NO:2) is analyzed by computer algorithms, a continuous coding sequence 
comprising the following domains is identified: Src homology, Dbl homology 
and pleckstrin homology domains. Various programs can be employed to 
analyze structure of the polypeptide, including, EMBL Protein Predict; Rost 

25 and Sander, Proteins, 19:55-72, 1994; Kyte and Doolittle, J. Mol. Bio.: 157:105, 

1982. 

A polypeptide of the present invention can also have 100% or less 
amino acid sequence identity to the amino acid sequence set forth in Fig 1. 
(SEQ ID NO: 2). For the purposes of the following discussion: Sequence 
30 identity means that the same nucleotide or amino acid which is found in the 

sequence set forth in Fig 1. (SEQ ID NO: 1 and SEQ ID NO: 2) is found at the 
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corresponding position of the compared sequence(s). A polvpeptide having 
less than 100% sequence identify to the amino acid sequence set forth in Fig 1 
(SEQ. ID NO: 2) can be substituted in various ways, e.g., bv a conservative 
ammo acid. See below for examples of conservative amino acid substitution 
The sum of the identical and conserved residues divided by the total number 
of residues in the sequence over which the Rac-GEF polypeptide is compared 
» equal to the percent sequence similarity. For purposes of calculating 
sequence identity and similarity, the compared sequences can be aligned and 
calculated according to any desired method, algorithm, computer program, 
etc., including, e.g., FASTA, BLASTA. A polypeptide having less than 100% 
ammo acid sequence identity to the amino acid sequence of Fig. 1 (SEQ ID 
NO: 2) can comprise e.g., about 60, 65. more preferably, 67, 70, 78, 80, 90 92 
96, 99, etc. 

A Rac GEF polypeptide, fragment, or substituted GEF polypeptide can 
also comprise various modifications, where such modifications include 
glycosylate, covalent modifications (e.g., of an R- gr0 up of an amino acid), 
ammo acid substitution, amino acid deletion, or amino acid addition 
Modulations to the polypeptide can be accomplished according to various 
methods, including recombinant, synthetic, chemical, etc. 

A mutation to a Rac-GEF polypeptide can be selected to have a 
biological activity of Rac-GEF, e.g., a specific binding affinity for a guanine 
nucleotide-depleted state of G-proteins, in particular Rac, a guanine nucleotide 
exchange activity, an oncogenic transforming activity, and an immunogenic 

activity. The selection and preparation of mutations of Rac-GEF is discussed 
below. 

Polypeptides of the present invention (e.g., Rac-GEF, fragments 
thereto, mutations thereof) can be used in various ways, e.g., as immunogens 
for antibodies as described below, as biologically-active agents (e.g., having 
one or more of the activities associated with Rac-GEF), as inhibitors of Rac- 
GEF. For example, upon binding of Rac-GEF to Rac, a cascade of events is 
initiated in the cell, e.g., promoting cell proliferation and/or cytoskeletal 



98/57990 



PCT/US98/12391 



rearrangements. The interaction between RaoGEF and Rac can be modulated 
by using a peptide fragment of Rac-GEF, e.g., a peptide fragment which is an 
inhibitor at the site where Rac-GEF interacts (e.g., binds) to Rac. Such a 
fragment can be useful for modulating pathological conditions associated with 
the Rac signaling pathway. A useful fragment can be identified routinely by 
testing the ability of overlapping fragments of the entire length of Rac-GEF to 
inhibit a Rac-GEF activity, such as guanine nucleotide exchange activity, 
binding to a guanine nucleotide depleted state of Rac. and oncogenic 
transforming activity. The measurement of certain of these activities is described 
below, and in the examples. These peptides can also be identified and 
prepared as described in EP 496 162. Peptides can be chemically-modified, 
etc. 

A polypeptide coding for a Rac-GEF polypeptide, or a derivative or 
fragment thereof, can be combined with one or more structural domains, 
functional domains, detectable domains, antigenic domains, and/or a desired 
polypeptides of interest, in an arrangement which does not occur in nature, 
i.e., not naturally-occurring, e.g., as in a normal Rac-GEF gene, a genomic 
fragment prepared from the genome of a living organism, e.g., an animal, 
preferably a mammal, such as human, mouse, or cell lines thereof. A 
polypeptide comprising such features is a chimeric or fusion polypeptide. 
Such a chimeric polypeptide can be prepared according to various methods, 
including, chemical, synthetic, quasi-synthetic, and /or recombinant methods. 
A chimeric nucleic acid coding for a chimeric polypeptide can contain the 
various domains or desired polypeptides in a continuous or interrupted open 
reading frame, e.g., containing introns, splice sites, enhancers, etc. The - 
chimeric nucleic acid can be produced according to various methods. See, e.g., 
U.S. Pat. No. 5,439,819. A domain or desired polypeptide can possess any 
desired property, including, a biological function such as catalytic, signalling, 
growth promoting, cellular targeting, etc., a structural function such as 
hydrophobic, hydrophilic, membrane-spanning, etc., receptor-ligand 
functions, and/or detectable functions, e.g., combined with enzyme, 
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fluorescent polypeptide, green fluorescent protein GFP (Chalfie et al 1994 
Science, 263:802; Cheng etal., 1996, Nature Biotechnology, 14:606; Levy et al 
1996, Nature Biotechnology, 14:610, etc. In addition, a Rac-GEF nucleic acid, or 
a part of it, can be used as selectable marker when introduced into a host cell 
For example, a nucleic acid coding for an amino acid sequence according to 
the present invention can be fused in-frame to a desired coding sequence and 
act as a tag for purification, selection, or marking purposes. The region of 
fusion encodes a cleavage site. 

A polypeptide according to the present invention can be produced in an 
expression system, e.g., in vivo, in vitro, cell-free, recombinant, cell fusion, 
etc., according to the present invention. Modifications to the polypeptide 
imparted by such system include, glycosylate, amino acid substitution (e g 
by during codon usage), polypeptide processing such as digestion, cleavage, 
endopeptidase or exopeptidase activity, attachment of chemical moieties 
including lipids, phosphates, etc. For example, some cell lines can remove the 
terminal methionine from an expressed polypeptide. 

A polypeptide according to the present invention can be recovered from 
natural sources, transformed host cells (culture medium or cells) according to 
the usual methods, including, ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, hydroxyapatite 
chromatography and lectin chromatography. It may be useful to have low 
concentrations (approximately 0.1-5 mM) of calcium ion present during 
purification (Price, et al., /. Biol Chenu, 244:917 (1969)). Protein refolding steps 
can be used, as necessary, in completing the configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be 
employed for final purification steps. 

In accordance with the present invention, a nucleic acid coding for a 
Rac-GEF can comprise, e.g., the complete coding sequence as set forth in Fig. 1 
(SEQ ID NO: 1). A nucleic acid according to the present invention can also 



10 



WO 98/57990 



PCT/US98/12391 



comprise a nucleotide sequence which is 100% complementary, e.g., an anti- 
sense, to any nucleotide sequence mentioned above and below. 

A Rac GEF encoding nucleic acid according to the present invention can 
be obtained from a variety of different sources. It can be obtained from DNA 

5 or RNA, such as polyadenylated mRNA, e.g., isolated from tissues, cells, or 

whole organism. The nucleic acid can be obtained directly from DNA or 
RNA, or from a cDNA library. The nucleic acid can be obtained from a cell at 
a particular stage of development, having a desired genotype, phenotype (e.g., 
an oncogenically transformed cell or a cancerous cell), etc. 

() A nucleic acid comprising a nucleotide sequence coding for a 

polypeptide according to the present invention can include only coding 
sequence of Rac-GEF; coding sequence of Rac-GEF and additional coding 
sequence (e.g., sequences coding for leader, secretory, targeting, enzymatic, 
fluorescent or other diagnostic peptides), coding sequence of Rac-GEF and 

5 non-coding sequences, e.g., untranslated sequences at either a 5* or 3' end, or 

dispersed in the coding sequence, e.g., introns. A nucleic acid comprising a 
nucleotide sequence coding without interruption for a Rac-GEF polypeptide 
means that the nucleotide sequence contains an amino acid coding sequence 
for a Rac-GEF polypeptide, with no non-coding nucleotides interrupting or 

0 intervening in the coding sequence, e.g., absent intron(s). Such a nucleotide 

sequence can also be described as contiguous. 

A nucleic acid according to the present invention also can comprise an 
expression control sequence operably linked to a nucleic acid as described 
above. The phrase "expression control sequence" means a nucleic acid 

5 sequence which regulates expression of a polypeptide coded for by a nucleic 

acid to which it is operably linked. Expression can be regulated at the level of 
the mRNA or polypeptide. Thus, the expression control sequence includes 
mRNA-related elements and protein-related elements. Such elements include 
promoters, enhancers (viral or cellular), ribosome binding sequences, 

0 transcriptional terminators, etc An expression control sequence is operably 

linked to a nucleotide coding sequence when the expression control sequence 
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is positioned in such a manner to effect or achieve expression of the coding 
sequence. For example, when a promoter is operably linked 5' to a coding 
sequence, expression of the coding sequence is driven by the promoter. 
Expression control sequences can be heterologous or endogenous to the 
normal gene. 

A nucleic acid in accordance with the present invention can be selected 
on the basis of nucleic acid hybridization. The ability of two single-stranded 
nucleic acid preparations to hybridize together is a measure of their nucleotide 
sequence complementarity, e.g., base-pairing between nucleotides, such as A- 
T, G-C, etc. The invention thus also relates to nucleic acids which hybridize to 
a nucleic acid comprising a nucleotide sequence as set forth in Fig. 1 (SEQ ID 
NO: 1). A nucleotide sequence hybridizing to the latter sequence will have a 
complementary nucleic acid strand, or act as a template for one in the presence 
of a polymerase (i.e., an appropriate nucleic acid synthesizing enzvme). The 
present invention includes both strands of nucleic acid, e.g., a sense strand and 
an anti-sense strand. 

Hybridization conditions can be chosen to select nucleic acids which 
have a desired amount of nucleotide complementarity with the nucleotide 
sequence set forth in Fig. 1 (SEQ ID NO: 1). A nucleic acid capable of 
hybridizing to such sequence, preferably, possesses 50%, more preferably, 70% 
complementarity, between the sequences. The present invention particularly 
relates to DNA sequences which hybridize to the nucleotide sequence set forth 
in Fig. 1 (SEQ ID NO: 1) under stringent conditions. As used here, "stringent 
conditions" means any conditions in which hybridization will occur where 
there is at least about 95%, preferably 97%, nucleotide complementarity 
between the nucleic acids. Such conditions include, e.g., hybridization for 
Northern: 5X SSPE, 10X Denhardts solution, 100 ug/ml freshly denatured and 
sheared salmon sperm DNA, 50% formamide, 2% SDS at 42°C; hybridization 
for cloning from cDNA library: IX PAM, 0.1% SDS, 50% formamide at 42°C 

According to the present invention, a nucleic acid or polypeptide can 
comprise one or more differences in the nucleotide or amino acid sequence set 
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forth in Fig: 1 (SEQ ID NO: 1 and SEQ ID NO: 2). Changes or modifications to 
the nucleotide and/or amino acid sequence can be accomplished by any 
method available, including directed or random mutagenesis. 

A nucleic acid coding for a Rac-GEF according to the invention can 
comprise nucleotides which occur in a naturally-occurring Rac-GEF gene e.g., 
naturally-occurring polymorphisms, normal or mutant alleles (nucleotide or 
amino acid), mutations which are discovered in a natural population of 
mammals, such as humans, monkeys, pigs, mice, rats, or rabbits. By the term 
naturally-occurring, it is meant that the nucleic acid is obtained from a natural 
source, e.g., animal tissue and cells, body fluids, tissue culture cells, forensic 
samples. Naturally-occurring mutations to Rac-GEF can include deletions 
(e.g., a truncated amino- or carboxy-terminus), substitutions, or additions of 
nucleotide sequence. These genes can be detected and isolated by nucleic acid 
hybridization according to methods which one skilled in the art would know. 
It is recognized that, in analogy to other oncogenes, naturally-occurring 
variants of Rac-GEF include deletions, substitutions, and additions which 
produce pathological conditions in the host cell and organism. 

A nucleotide sequence coding for a Rac-GEF polypeptide of the 
invention can contain codons found in a naturally-occurring gene, transcript, 
or cDNA, for example, e.g., as set forth in Fig. 1 (SEQ ID NO: 1), or it can 
contain degenerate codons coding for the same amino acid sequences. 

In addition, a nucleic acid or polypeptide of the present invention can 
be obtained from any desired mammalian organism, but also non-mammalian 
organisms. Homologs from mammalian and non-mammalian organisms can 
be obtained according to various methods. For example, hybridization with 
an appropriate oligonucleotide selective for Rac-GEF can be employed to 
select such homologs, e.g., as described in Sambrook et al., Molecular Cloning, 
1989, Chapter 11. 

Such homologs can have varying amounts of nucleotide and amino acid 
sequence identity and similarity to Rac-GEF. Non-mammaiian organisms 
include, e.g., vertebrates, invertebrates, chicken, Drosophila, yeasts (such as 
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Saccharomyces cerevisiae), C. elegans, roundworms, prokaryotes, plants, 
Arabidopsis, viruses, etc. 

Modifications to a Rac-GEF sequence, e.g., mutations, can also be 
prepared based on homology searching from gene data banks, e.g., Genbank, 
EMBL. Sequence homology searching can be accomplished using various 
methods, including algorithms described in the BLAST family of computer 
programs, the Smith-Waterman algorithm, etc. For example, conserved amino 
acids can be identified between various sequences, Dbl, Ibc, Ost, lsc, CDC24, 
etc. See, e.g., Touhara et al., J. Biol. Chem., 269:10217-10220, 1994; Toksoz and 
Williams, Oncogene, 9:621-628, 1994; Whitehead et al., J. Biol. Chem., 
271:18643-18650, 1996. A mutation(s) can then be introduced into a Rac-GEF 
sequence by identifying and aligning amino acids conserved between the 
polypeptides and then modifying an amino acid in a conserved or non- 
conserved position. A mutated Rac-GEF gene can comprise conserved or 
nonconserved amino acids, e.g., between corresponding regions of 
homologous nucleic acids, especially between Dbl homology (DH) domains, 
etc. For example, a mutated sequence can comprise conserved or non- 
conserved residues from any number of homologous sequences as mentioned- 
above and/or determined from an appropriate searching algorithm. 

Mutations can be made in specific regions of nucleic acid coding for the 
Rac-GEF polypeptide, e.g., in the Dbl homology domain, such as replacing it, 
changing amino acid sequences within it, etc., to analyze a function (e.g., 
oncogenic transformation, binding to a G-protein, guanine nucleotide 
exchange) of the polypeptide coded for by the nucleic acid. For example, 
deletion of the pleckstrin domain would result in the loss of oncogenic 
transforming activity. The pleckstrin domain can also be involved with lipid 
(e.g., phosphoinositides) binding, binding to Rac, activation of the guanine 
nucleotide exchange activity, and localization of the polypeptide in the cell. 
Thus, this region can be mutagenized according to various methods and then 
assayed for loss or gain of the mentioned functions. The DH domain is 
involved with promoting GDP dissociation from the Rac GTPase. Thus, 



14 



WO 98/57990 



PCT/US98/12391 



substitutions or deletions within this region can be prepared and assayed 
routinely for loss or gain of function. A mutation can be made in these or 
other regions of Rac-GEF which affect its phosphorylation or protein/lipid 
interaction leading to its modulation of the growth signaling pathway. Such a 
5 mutated gene can be useful in various ways; for diagnosis in patients having 

such a mutation, to introduce into cells or animals (transgenic) as a model for a 
pathological condition. Mutations which affect both GEF activity and 
transforming activity can be analogous to those made in the DH domain of the 
Dbl oncogene as described in Hart et al, T. Biol. Chem.. 269:62-65. 

10 A nucleic acid and corresponding polypeptide of the present invention 

include sequences which differ from the nucleotide sequence of Fig. 1 (SEQ ID 
NO: 1) but which are phenotypically silent. These sequence modifications 
include, e.g., nucleotide substitution which do not affect the amino acid 
sequence (e.g., different codons for the same amino acid), replacing naturally- 

15 occurring amino acids with homologous or conservative amino acids, e.g., 

(based on the size of the side chain and degree of polarization) small nonpolar: 
cysteine, proline, alanine, threonine; small polar: serine, glycine, aspartate, 
asparagine; large polar: glutamate, glutamine, lysine, arginine; intermediate 
polarity: tyrosine, histidine, tryptophan; large nonpolar: phenylalanine, 

20 methionine, leucine, isoleucine, valine. Such conservative substitutions also 

include those described by Dayhoff in the Atlas of Protein Sequence and 
Structure 5 (1978), and by Argos in EMBOT.. 8, 779-785 (1989). 

A nucleic acid can comprise a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in Fig 1. (SEQ ID NO: 

25 2 ) except where one or more positions are substituted by conservative amino 

acids; or a nucleotide sequence coding for a polypeptide having an amino acid 
sequence as set forth in Fig 1. (SEQ ID NO:2). The invention also relates to 
polypeptides coded for by such nucleic acids. In addition, it may be desired to 
change the codons in the sequence to optimize the sequence for expression in a 

30 desired host. 
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A nucleic acid according to the present invention can comprise, e.g., 
DNA, RNA, synthetic nucleic acid, peptide nucleic acid, modified nucleotides, 
or mixtures. A DNA can be double- or single-stranded. Nucleotides 
comprising a nucleic acid can be joined via various known linkages, e.g., ester, 
sulfamate, sulfamide, phosphorothioate, phosphoramidate, 
methylphosphonate, carbamate, etc., depending on the desired purpose, e.g., 
resistance to nucleases, such as RNase H, improved in vivo stability, etc. See, 
e.g., U.S. Pat. Nos. 5,378,825. 

Various modifications can be made to the nucleic acids, such as 
attaching detectable markers (avidin, biotin, radioactive elements), moieties 
which improve hybridization, detection, or stability. The nucleic acids can 
also be attached to solid supports, e.g., nitrocellulose, nyion, agarose, 
diazotized cellulose, latex solid microspheres, polyacrylamides, etc., according ■ 
to a desired method. See, e.g., U.S. Pat. Nos. 5,470,967, 5,476,925, 5,478,893. 

Another aspect of the present invention relates to oligonucleotides and 
nucleic acid probes. Such oligonucleotides or nucleic acid probes can be used, 
e.g. to detect, quantitate, or isolate a Rac-GEF nucleic acid in a test sample 
Detection can be desirable for a variety of different purposes, including 
research, diagnostic, and forensic. For diagnostic purposes, it mav be 
desirable to identify the presence or quantity of a Rac-GEF nucleic acid 
sequence in a sample, where the sample is obtained from tissue, cells, body 
fluids, etc. In a preferred method, the present invention relates to a method of 
detecting a Rac-GEF nucleic acid comprising, contacting a target nucleic acid 
in a test sample with an oligonucleotide under conditions effective to achieve 
hybridization between the target and oligonucleotide; and detecting 
hybridization. An oligonucleotide in accordance with the invention can also 
be used in synthetic nucleic acid amplification such as PGR, e.g., Saiki et al., 
1988, Science, 241:53; U.S. Pat. No. 4,683,202. 

Another aspect of the present invention is a nucleotide sequence which 
is unique to Rac-GEF. By a unique sequence to Rac-GEF, it is meant a 
defined order of nucleotides which occurs in Rac-GEF, e.g., in the nucleotide 
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sequence of Fig. 1 (SEQ ID NO: 1), but rarely or infrequently in other nucleic 
acids, especially not in an animal nucleic acid, preferably mammal, such as 
human, rat, mouse, etc. Both sense and antisense nucleotide sequences are 
included. A unique nucleic acid according to the present invention can be 
determined routinely. A nucleic acid comprising a unique sequence of Rac- 
GEF can be used as a hybridization probe to identify the presence of Rac-GEF 
in a sample comprising a mixture of nucleic acids, e.g., on a Northern blot. 
Hybridization can be performed under stringent conditions to select nucleic 
acids having at least 95% identity (i.e., complementarity) to the probe, but less 
stringent conditions can also be used. A unique Rac-GEF nucleotide sequence 
can also be fused in-frame, at either its 5' or 3' end, to various nucleotide 
sequences as mentioned throughout the patent, including coding sequences for 
other parts of Rac-GEF, enzymes, GFP, etc, expression control sequences, etc. 

Hybridization can be performed under different conditions, depending 
on the desired selectivity, e.g., as described in Sambrook et al., Molecular 
Cloning, 1989. For example, to specifically detect Rac-GEF, an oligonucleotide 
can be hybridized to a target nucleic acid under conditions in which the 
oligonucleotide only hybridizes to Rac-GEF, e.g., where the oligonucleotide is 
100% complementary to the target. Different conditions can be used if it is 
desired to select target nucleic acids which have less than 100% nucleotide 
complementarity, at least about, e.g., 99%, 97%, 95%, 90%, 70%, 67%. Since a 
mutation in a Rac-GEF gene can cause diseases or pathological conditions, 
e.g., cancer, benign tumors, an oligonucleotide according to the present 
invention can be used diagnostically. For example, a patient having 
symptoms of a cancer or other condition associated with the Rac signaling 
pathway (see below) can be diagnosed with the disease by using an 
oligonucleotide according to the present invention, in polymerase chain 
reaction followed by DNA sequencing to identify whether the sequence is 
normal, in combination with other oncogene oligonucleotides, etc., e.g., p53, 
Rb, p21, Dbl, MTS1, Wtl, Bcl-1, Bcl-2, MDM2, etc. In a preferred method, the 
present invention relates to a method of diagnosing a cancer comprising 
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contacting a sample comprising a target nucleic acid with an oligonucleotide 
under conditions effective to permit hybridization between the target and 
oligonucleotide; detecting hybridization, wherein the oligonucleotide 
comprises a sequence of Rac-GEF, preferably a unique sequence of Rac-GEF; 
and determining the nucleotide sequence of the target nucleic acid to which 
the oligonucleotide is hybridized. The sequence can be determined according 
to various methods, including isolating the target nucleic acid, or a cDNA 
thereof, and determining its sequence according to a desired method. 

Oligonucleotides according to the present invention can be of any 
desired size, preferably 14-16 oligonucleotides in length, or more. Such 
oligonucleotides can have non-naturally-occurring nucleotides, e.g., inosine. 
In accordance with the present invention, the oligonucleotide can comprise a 
kit, where the kit includes a desired buffer (e.g., phosphate, tris, etc.), 
detection compositions, etc. The oligonucleotide can be labeled or unlabeled, 
with radioactive or non-radioactive labels as known in the art. 

Anti-sense nucleic acid can also be prepared from a nucleic acid 
according to the present, preferably an anti-sense to a coding sequence of Fig. 
1 (SEQ ID NO: 1). Antisense nucleic acid can be used in various ways, such as 
to regulate or modulate expression of Rac-GEF, e.g., inhibit it, to detect its 
expression, or for in situ hybridization. For the purposes of regulating or 
modulating expression of Rac-GEF, an anti-sense oligonucleotide can be 
operably linked to an expression control sequence. 

The nucleic acid according to the present invention can be labelled 
according to any desired method. The nucleic acid can be labeled using 
radioactive tracers such as 32 P , 35 S/ 125^ 3 H , or 14 Q t0 mention onJy ^ most 
commonly used tracers. The radioactive labelling can be carried out according 
to any method such as, for example, terminal labeling at the 3' or 5' end using 
a radiolabeled nucleotide, polynucleotide kinase (with or without 
dephosphorylation with a phosphatase) or a ligase (depending on the end to 
be labelled). A non-radioactive labeling can also be used, combining a nucleic 
acid of the present invention with residues having immunological properties 
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(antigens, haptens), a specific affinity for certain reagents (ligands), properties 
enabling detectable enzyme reactions to be completed (enzymes or coenzymes, 
enzyme substrates, or other substances involved in an enzymatic reaction), or 
characteristic physical properties, such as fluorescence or the emission or 
5 absorption of light at a desired wavelength, etc. 

A nucleic acid according to the present invention, including 
oligonucleotides, anti-sense nucleic acid, etc., can be used to detect expression 
of Rac-GEF in whole organs, tissues, cells, etc., by various techniques, 
including Northern blot, PCR, in situ hybridization, etc. Such nucleic acids 
can be particularly useful to detect disturbed expression, e.g., cell-specific 
and/or subcellular alterations, of Rac-GEF. The levels of Rac-GEF can be 
determined alone or in combination with other genes products (oncogenes 
such as p53, Rb, Wtl, etc.), transcripts, etc. 

A nucleic acid according to the present invention can be expressed in a 
15 variety of different systems, in vitro and in vivo, according to the desired 

purpose. For example, a nucleic acid can be inserted into an expression vector, 
introduced into a desired host, and cultured under conditions effective to 
achieve expression of a polypeptide coded for the nucleic acid. Effective 
conditions includes any culture conditions which are suitable for achieving 
20 production of the polypeptide by the host cell, including effective 

temperatures, pH, medias, additives to the media in which the host cell is 
cultured (e.g., additives which amplify or induce expression such as butyrate, 
or methotrexate if the coding nucleic acid is adjacent to a dhfr gene), 
cyclohexamide, cell densities, culture dishes, etc. A nucleic acid can be 
25 introduced into the cell by any effective method including, e.g., calcium 

phosphate precipitation, electroporation, injection, DEAE-Dextran mediated 
transfection, fusion with liposomes, and viral transfection. A cell into which a 
nucleic acid of the present invention has been introduced is a transformed host 
cell. The nucleic acid can be extrachromosomal or integrated into a 
chromosome(s) of the host cell. It can be stable or transient. An expression 
vector is selected for its compatibility with the host cell. Host cells include, 
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mammalian cells, e.g., COS-7, CHO, HeLa, LTK, NIH 3T3, Rat 1 fibroblasts, 
yeast, insect cells, such as Sf9 (S. frugipeda) and Drosophila, bacteria, such as 
E. coli, Streptococcus, bacillus, yeast, fungal cells, plants, embryonic stem cells 
(e.g., mammalian, such as mouse or human), cancer or tumor cells. Sf9 
expression can be accomplished in analogy to Graziani et al., Oncogene, 7:229- 
235, 1992. Expression control sequences are similarly selected for host 
compatibility and a desired purpose, e.g., high copy number, high amounts, 
induction, amplification, controlled expression. Other sequences which can be 
employed include enhancers such as from SV40, CMV, inducible promoters, 
cell-type specific elements, or sequences which allow selective or specific cell 
expression. 

In addition to a Rac-GEF nucleic acid, another gene of interest can be 
mtxoduced into the same host for purposes of, e.g., modulating expression 
Rac-GEF, elucidating Rac-GEF function or that of the gene of interest. Genes 
of mterest include other oncogenes, genes involved in the cell cycle, etc. Such 
genes can be the normal gene, or a variation, e.g., a mutation, chimera, 
polymorphism, etc. 

A nucleic acid or polypeptide of the present invention can be used as a 
size marker in nucleic acid or protein electrophoresis, chromatography, etc. 
Defined restriction fragments can be determined by scanning the sequence for 
restriction sites, calculating the size, and performing the corresponding 
restriction digest. For example, the Rac-GEF polypeptide from fetal brain can 
also be used as a molecular weight marker of about 74.7 kDa for a protein gel. 

Another aspect of the present invention relates to the regulation of 
biological pathways in which a GTPase is involved, particularly pathological 
conditions, e.g., cell proliferation (e.g., cancer), growth control, 
morphogenesis, stress fiber formation, and integrin-mediated interactions, 
such as embryonic development, tumor cell growth and metastasis, 
programmed cell death, hemostasia leucocyte homing and activation, bone 
resorption, clot retraction, and the response of cells to mechanical stress. See, 
e.g., Clark and Brugge, Science, 268:233- 239, 1995; Bussey, Science, 272:225- 
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226, 1996. Thus, the invention relates to all aspects of a method of modulating 
an activity of a Rac polypeptide comprising, administering an effective 
amount of a Rac-GEF polypeptide or a biologically-active fragment thereof, 
an effective amount of a compound which modulates the activity of a Rac 
polypeptide, or an effective amount of a nucleic acid which codes for a Rac- 
GEF polypeptide or a biologically-active fragment thereof. The activity of Rac 
which is modulated can include: GTP binding, GDP binding, GTPase activity, 
integrin binding, coupling or binding of Rac to receptor or effector-like 
molecules (such as integrins, growth factor receptors, tyrosine kinases, PI-3K, 
PIP-5K, etc.). See, e.g., Clark and Brugge, Science, 268:233-239, 1995. The 
activity can be modulated by increasing, reducing, antagonizing, promoting, 
etc. of Rac. The modulation of Rac can be measured by assay for GTP 
hydrolysis, binding to Rac-GEF, etc. An effective amount is any amount 
which, when administered, modulates the Rac activitv. The activitv can be 
modulated in a cell, a tissue, a whole organism, in situ, in vitro (test tube, a 
solid support, etc.), in vivo, or in any desired environment. 

Compounds that regulate the interaction between a GEF, such Rac- 
GEF, and a GTPase can be identified using an assay for a GEF activity, such as 
guanine nucleotide exchange activity, binding to a guanine nucleotide- 
depleted site of a GTPase, or oncogenic transforming activity, or a GTPase 
activity such as GTP hydrolysis. In general, a compound having such an in 
vitro activity will be useful in vivo to modulate a biological pathway associated 
with a GTPase, e.g., to treat a pathological condition associated with the 
biological and cellular activities mentioned above. By way of illustration, the 
ways in which GEF regulators can be identified are described above and 
below in terms of Rac and Rac-GEF. However, it is to be understood that 
such methods can be applied generally to other GEFs. 

A guanine nucleotide exchange assay, e.g., as described in Hart et al., 
Nature, 354:311-314, 28 Nov. 1991 (see, especially, Figure 2 legend therein), can 
be used to assay for the ability of a compound to regulate the interaction 
between Rac and Rac-GEF. For example, Rac protein (recombinant, 
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recombinant fusion protein, or isolated from natural sources) is labeled with 
tntiated-GDP. The tritiated-GDP-labeled Rac is then incubated with Rac-GEF 
and GTP under conditions in which nucleotide exchange occurs. The amount 
of trihated-GDP that is retained by Rac is determined by separating bound 
GDP from free GDP, e.g., using a BA85 filter. The ability of a compound to 
regulate the interaction can be determined by adding the compound at a 
desired time to the incubation (e.g., before addition of a Rac-GEF, after 
addition of a Rac-GEF) and determining its effect on nucleotide exchange 
Vanous agonist and antagonists of the interaction can be identified in this 
manner. For instance, an aspect of the instant invention is the discovery that 
certain compounds greatly enhance the activity of Rac-GEFs, and preferrablv 
of the Rac-GEF, Tiam-1. Such compounds are hereinafter termed "GEF 
enhancers." Such compounds have certain similar chemical features including 
a hydrocarbon arm, preferrably consisting of substantially saturated bonds 
that hnk the carbon residues together, and also preferrably the number of 
carbon atoms should be between 12-22. A second feature of such compounds 
is the association of the hydrocarbon arm to either a 5 or 6 membered ring 
structure. Preferred 5 or 6 membered compounds include ascorbate and 
certain cyclohexanes, respectively. The more preferred 5 membered 
compounds are derivatives of ascorbate, while the more preferred 
cyclohexanes include insoitol. 

Binding to a guanine nucleotide-depleted site of Rac can be determined 
« various ways, e.g., as described in Hart et aJ., J. Biol. Chen,, 269:62-65, 1994 
Bnefly, a Rac protein can be coupled to a solid support using various methods 
that one skilled in the art would know, e.g., using an antibody t0 ^ a fusion 
protem between Rac and a marker protein, such as glutathione protein (GST) 
wherein the fusion is coupled to a solid support via the marker protein (such 
as glutathione beads when GST is used), etc. The Rac protein is converted to 
a guanine nucleotide depleted state (for effective conditions, see, e.g., Hart et 
al., J. Biol. Chem., 269:62-65, 1994) and incubated with, e.g., GDP, GTPyS, and 
a GEF such as Rac-GEF. The solid support is then separated and any protein 
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on it run on a gel. A compound can be added at any time during the 
incubation (as described above) to determine its effect on the binding of the 
GEF to Rac. 

The modulation of oncogenic transforming activity by a Rac-GEF, or 
5 derivatives thereof, can be measured according to various known procedures, 

e.g., Eva and Aaronson, Nature, 316:273-275, 1985; Hart et al., J. Biol. Chem., 
269:62-65, 1994. A compound can be added at any time during the method 
(e.g., pretreatment of cells; after addition of GEF, etc.) to determine its effect 
on the oncogenic transforming activity of Rac-GEF. Various cell lines can also 
10 be used. 

Other assays for Rac-mediated signal transduction can be accomplished 
according to procedures known in the art, e.g., as described in U.S. Pat. Nos. 
5,141,851; 5,420,334; 5,436,128; and 5,482,954; W094/16069; W093/16179; 
W091/15582; WO90/00607. In addition, peptides which inhibit the 

15 interaction, e.g., binding, between Rac-GEF and a G-protein, such as Rac, can 

be identified and prepared according to EP 496 162. 

The present invention also relates to a method of testing for and 
identifying an agent which modulates the guanine nucleotide exchange 
activity of a guanine nucleotide exchange factor, or a biologically-active 

20 fragment thereof, or which modulates the binding between a Rac-GEF, or a 

biologically-active fragment thereof, and a GTPase, or a biologically-active 
fragment thereof, to which it binds. The method comprises contacting the GEF 
and GTPase with an agent to be tested and then detecting the presence or 
amount of binding between the GEF and GTPase, or an activity of the GEF 

25 such as guanine nucleotide exchange activity. By modulating, it is meant that 

addition of the agent affects the activity or binding. The binding or activity 
modulation can be affected in various ways, including inhibiting, blocking, 
preventing, increasing, enhancing, or promoting it. The binding or activity 
affect does not have to be achieved in a specific way, e.g., it can be 

30 competitive, noncompetitive, allosteric, sterically hindered, via cross-linking 

between the agent and the GEF or GTPase, etc. The agent can act on either the 
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GEF or GTPase. The agent can be an agonist, an antagonist, or a partial 
agonist or antagonist. The presence or amount of binding can be determined 
in various ways, e.g., directly or indirectly by assaying for an activity 
promoted or inhibited by the GEF, such as guanine nucleotide exchange, GTP 
hydrolysis, oncogenic transformation, etc. Such assays are described above 
and below, and are also known in the art. The agent can be obtained and/or 
prepared from a variety of sources, including natural and synthetic. It can 
comprise, e.g., amino acids, lipids, carbohydrates, organic molecules, nucleic 
acids, inorganic molecules, or mixtures thereof. See, e.g., Hoeprich, Nature 
Biotechnology, 14:1311-1312, 1996, which describes an example of automated 
synthesis of organic molecules. The agent can be added simultaneously or 
sequentially. For example, the agent can be added to the GEF and then the 
resultant mixture can be further combined with the GTPase. The method can 
be carried out in liquid on isolated components, on a matrix (e.g., filter paper, 
nitrocellulose, agarose), in ceils, on tissue sections, etc. In accordance with the 
method, a GEF can bind to the GTPase, which binding will modulate some 
GTPase activity. For example, as discussed above and below, a Rac-GEF binds 
to Rac, causing guanine nucleotide dissociation. The effect can be directly on 
the binding site between the GEF and GTPase, or it can be allosteric, or it can 
be on only one component (e.g., on the GEF only). Assays for guanine 
nucleotide dissociation can be readily adapted to identify agents which 
regulate the activity of a GTPase. The method further relates to obtaining or 
producing agents which have been identified according to the above- 
described method. 

The present invention also relates to products identified in accordance 
with such methods. Various GEFs and GTPases can be employed, including, - 
Rac-GEF, mSOS, SOS, C3G, lsc, Dbl, Dbl-related proteins, polypeptides 
comprising one or more DH domains, CDC24, Tiam-1, Ost, Lbc, Vav, Ect2, 
Bcr, Abr, Rho (A, B, and C), Rac, Ras, CDC42, chimeras thereof, biologically- 
active fragments thereof, muteins thereof, etc. 
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The present invention thus also relates to the treatment and prevention 
of diseases and pathological conditions associated with Rac-mediated signal 
transduction, e.g., cancer, diseases associated with abnormal cell proliferation. 
For example, the invention relates to a method of treating cancer comprising 
5 administering, to a subject in need of treatment, an amount of a compound 

effective to treat the disease, where the compound is a regulator of Rac-GEF 
gene or polypeptide expression. Treating the disease can mean, delaying its 
onset, delaying the progression of the disease, improving or delaying clinical 
and pathological signs of disease. Similarly, the method also relates to treating 

10 diseases associated with inflammation, and/or the chemotactic ability of 

neutrophils. A regulator compound, or mixture of compounds, can be 
synthetic, naturally-occurring, or a combination. A regulator compound can 
comprise amino acids, nucleotides, hydrocarbons, lipids, polysaccharides, etc 
A regulator compound is preferably a regulator of Rac-GEF, e.g., inhibiting or 

15 increasing its mRNA, protein expression, or processing, or its interaction with 

Rac, e.g., guanine nucleotide exchange. Additionally, cells can be 
supplemented with Rac-GEF, or derivatives thereof. To treat the disease, the 
compound, or mixture, can be formulated into pharmaceutical composition 
comprising a pharmaceutical^ acceptable carrier and other excipients as 

20 apparent to the skilled worker. See, e.g., Remington 's Pharmaceutical Sciences, 

Eighteenth Edition, Mack Publishing Company, 1990. Such composition can 
additionally contain effective amounts of other compounds, especially for 
treatment of cancer. 

The present invention also relates to antibodies which specifically 

25 recognize a Rac-GEF polypeptide. Antibodies, e.g., polyclonal, monoclonal, 

recombinant, chimeric, can be prepared according to any desired method. For 
example, for the production of monoclonal antibodies, a polypeptide 
according to Fig. 1 (SEQ ID NO: 2), can be administered to mice, goats, or 
rabbit subcutaneously and /or intraperitoneally, with or without adjuvant, in 

30 . an amount effective to elicit an immune response. The antibodies can also be 

single chain or FAb. The antibodies can be IgG, subtypes, IgG2a, IgGl, etc. 
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An antibody specific for Rac-GEF means that the antibody recognizes a 
defined sequence of amino acids within or including the Rac-GEF amino acid 
sequence of Fig. 1 (SEQ ID NO: 2). Thus, a specific antibody will bind with 
higher affinity to an amino acid sequence, i.e., an epitope, found in Fig. 1 (SEQ 
ID NO: 2) than to a different epitope(s), e.g., as detected and/or measured by 
an immunoblot assay. Thus, an antibody which is specific for an epitope of 
Rac-GEF is useful to detect the presence of the epitope in a sample, e.g., a 
sample of tissue containing Rac-GEF gene product, distinguishing it from 
samples in which the epitope is absent. Such antibodies are useful as 
described in Santa Cruz Biotechnology, Inc., Research Product Catalog, can be 
formulated accordingly, e.g., 100 ug/ml. 

In addition, ligands which bind to a Rac-GEF polypeptide according to 
the present invention, or a derivative thereof, can also be prepared, e.g., using 
synthetic peptide libraries, or nucleic acid ligands (e.g., Pitrung et al„ U.S. Pat. 
No. 5,143,854; Geysen et al., 1987, J. Immunol. Methods, 102:259-274; Scott et 
al., 1990, Science, 249:386; Blackwell et al., 1990, Science, 250:1104; Tuerk et al., 
1990, Science, 249: 505. 

Antibodies and other ligands which bind Rac-GEF can be used in 
various ways, including as therapeutic, diagnostic, and commercial research 
tools, e.g, to quantitate the levels of Rac-GEF polypeptide in animals, tissues, 
cells, etc., to identify the cellular localization and/or distribution of Rac-GEF, 
to purify Rac-GEF or a polypeptide comprising a part of Rac-GEF, to 
modulate the function of Rac-GEF, etc. Antibodies to Rac-GEF, or a 
derivative thereof, can be used in Western blots, ELISA, immunoprecipitation, 
RIA, etc. The present invention relates to such assays, compositions and kits 
for performing them, etc. 

An antibody according to the present invention can be used to detect 
Rac-GEF polypeptide or fragments thereof in various samples, including 
tissue, cells, body fluid, blood, urine, cerebrospinal fluid. A method of the 
present invention comprises contacting a ligand which binds to a peptide of 
Fig 1. (SEQ ID NO: 2) under conditions effective, as known in the art, to 
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achieve binding, detecting specific binding between the ligand and peptide. 
By specific binding, it is meant that the ligand attaches to a defined sequence 
of amino acids, e.g., within or including the amino acid sequence of Fig 1. 
(SEQ ID NO: 2) or derivatives thereof. The antibodies or derivatives thereof 
5 can also be used to inhibit expression of Rac-GEF or a fragment thereof. The 

levels of Rac-GEF polypeptide can be determined alone or in combination 
with other gene products. In particular, the amount (e.g., its expression level) 
of Rac-GEF polypeptide can be compared (e.g., as a ratio) to the amounts of 
other polypeptides in the same or different sample, e.g., p21, p53, Rb, WT1, 

10 etc. . 

A ligand for Rac-GEF can be used in combination with other 
antibodies, e.g., antibodies that recognize oncological markers of cancer, 
including, Rb, p53, c-erbB-2, oncogene products, etc. In general, reagents 
which are specific for Rac-GEF can be used in diagnostic and/or forensic 

15 studies according to any desired method, e.g., as U.S. Pat. Nos. 5,397,712; 

5,434,050; 5,429,947. 

The present invention also relates to a labelled Rac-GEF polypeptide, 
prepared according to a desired method, e.g., as disclosed in U.S. Pat. No. 
5,434,050. A labelled polypeptide can be used, e.g., in binding assays, such as 

20 to identify substances that bind or attach to Rac-GEF, to track the movement 

of Rac-GEF in a cell, in an in vitro, in vivo, or in situ system, etc. 

A nucleic acid, polypeptide, antibody, Rac-GEF ligand etc., according 
to the present invention can be isolated. The term "isolated" means that the 
material is in a form in which it is not found in its original environment, e.g., 

25 more concentrated, more purified, separated from component, etc. An 

isolated nucleic acid includes, e.g., a nucleic acid having the sequence of Rac- 
GEF separated from the chromosomal DNA found in a living animal. This 
nucleic acid can be part of a vector or inserted into a chromosome (by specific 
gene-targeting or by random integration at a position other than its normal 

30 position) and still be isolated in that it is not in a form which it is found in its 

natural environment. A nucleic acid or polypeptide of the present invention 
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can also be substantially purified. By substantially purified, it is meant that 
nucleic acid or polypeptide is separated and is essentially free from other 
nucleic acids or polypeptides, i.e., the nucleic acid or polypeptide is the 
primary and active constituent. 

The present invention also relates to a transgenic animal, e.g., a non- 
human-mammal, such as a mouse, comprising a Rac-GEF nucleic acid. 
Transgenic animals can be prepared according to known methods, including, 
e.g., by pronuclear injection of recombinant genes into pronuclei of 1-cell 
embryos, incorporating an artificial yeast chromosome into embryonic stem 
cells, gene targeting methods, embryonic stem cell methodology. See, e.g., 
U.S. Patent Nos. 4,736,866; 4,873,191; 4,873,316; 5,082,779; 5,304,489; 5,174,986; 
5,175,384; 5,175,385; 5,221,778; Gordon et al., Proc. Natl. Acad. ScL, 77:7380-7384 
(1980); Palmiter et al., Cell, 41:343-345 (1985); Palmiter et al., Ann. Rev. Genet, 
20:465-499 (1986); Askew et al., Mol. Cell. Bio., 13:4115-4124, 1993; Games et al. 
Nature, 373:523-527, 1995; Valancius and Smithies, Mol. Cell. Bio., 11:1402-1408, 
1991; Stacey et al., Mol. Cell. Bio., 14:1009-1016, 1994; Hasty et al., Nature, 
350:243-246, 1995; Rubinstein et al., Nucl. Acid Res., 21:2613-2617,1993. A 
nucleic acid according to the present invention can be introduced into any 
non-human mammal, including a mouse (Hogan et al., 1986, in Manipulating 
the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York), pig (Hammer et al., Nature, 315:343-345, 1985), 
sheep (Hammer et al., Nature, 315:343-345, 1985), cattle, rat, or primate. See 
also, e.g., Church, 1987, Trends in Biotech. 5:13-19; Clark et al., 1987, Trends in 
Biotech. 5:20-24; and DePamphilis et al., 1988, BioTechniques, 6:662-680. In 
addition, e.g., custom transgenic rat and mouse production is commercially 
available. These transgenic animals are useful as a cancer model, e.g., to test 
drugs. 

Generally, the nucleic acids, polypeptides, antibodies, etc. of the present 
invention can be prepared and used as described in U.S. Pat. Nos. 5,501,969; 
5,506,133; 5,441,870; WO 90/00607; and WO 91/15582. 
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For other aspects of the nucleic acids, polypeptides, antibodies, etc., 
reference is made to standard textbooks of molecular biology, protein science, 
and immunology. See, e.g., Davis et al. (1986), Basic Methods in Molecular 
Biology, Elsevir Sciences Publishing, Inc., New York; Hames et al. (1985), 
5 Nucleic Acid Hybridization, IL Press, Molecular Cloning. Sambrook et al.: Current 

Protocols in Molecular Biology. Edited by F.M. Ausubel ei al.. John Wiley & Sons. 
Inc: Current Protocols in Human Genetics. Edited by Nicholas C. Dracopoli et al.. 
John Wiley & Sons. Inc.: Current Protocols in Protein Science: Edited by John E. 
Coligan et al.. John Wiley & Sons. Inc.: Current Protocols in Immunology: Edited by 
10 John E. Coligan et al.. John Wiley & Sons. Inc: 

EXAMPLES 
Example 1 
Cloning of cDNA encoding Rac GEF 

A Dbl-homology domain containing protein was identified in a human fetal 
15 brain cDNA library as follows. A TBLASTN search of the dbEST database 

was performed using the amino acid sequence (residues 1-519) encoded by the 

human TIM protein (Chan et al., 1994, Oncogene, Vol. 9, pages 1057-1063). 

One EST clone, # 167059 was identified with high sequence homology to the 

TIM cDNA. The plasmid encoding this insert was purchased via the 
20 LM.A.G.E. Consortium (Research Genetics). Using this cDNA as template, a 

511-bp 3: P-labelled PCR product was produced using oligos 

5'-GGAGGCCATGTTCGAGCTGG-3' and 

5'- GCTGATCATCTGTTCCGTGC-3' (5' and 3' primers, respectively) and *P 
labelled nucleotides. This labeled PCR fragment was used as a probe to screen 

25 approximately 4 x 10* clones of a human fetal brain Lambda ZAP cDNA 

library (Stratagene). A clone with an insert of 2.6-kb was isolated, and the 
complete DNA sequence of this clone was determined using an ABI sequencer. 
This 2.6-kb clone harbored a single open reading frame of 1950-bp that is 
predicted to encode a 650-amino acids protein with a calculated molecular 

30 mass of 74.7 kDa. However, this open reading frame is not full-length, as the 

initiating methionine is missing. This cDNA is on deposit with the American 
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Type Culture Collection, December 11, 1996, with the Accession No.98273, 
and is denoted p67 Rac-GEF. 

Northern analysis using the probe described above was conducted. The 
results revealed a 3.5 kb transcript specific to brain tissue and an additional 4 
5 kb transcript of lower abundance specific to liver tissue. Other normal tissues 

tested, including heart, placenta, lung, muscle, kidney, pancreas, spleen, 
thymus, prostate, testis, ovary, intestine, colon and peripheral blood 
lymphocytes were also essentially negative. In a preliminary screen of human 
tumor cell lines, abundant 3.5 kb mRNA levels were detected in the lung 
1 0 carcinoma cell line A549 and the colon carcinoma cell line SW480. Other 

tumor cell lines were negative, including HL-60, HeLa, K-562, Molt-4, Raji and 
G-36. Further screening of a number of primary tumor samples revealed over- 
expression in liver, lung and colon tumors. 

Using the additional sequence identified in the 2.6-kb clone, further 
15 analysis of the dbEST database using the Blastn program identified an 

additional clone, # 109922, which had been isolated from a liver library. The 
plasmid encoding this insert was purchased via the I.M.A.G.E. Consortium 
(Genome Systems), and the sequence of the insert was determined. This 
sequence revealed an initiating methionine and 126 additional amino acids 
20 which differed from the amino-terminal 66 amino acids of the 2.6 kb brain 

clone described above. This new sequence most likely encodes the liver- 
specific alternatively spliced form which had been identified by Northern 
analysis. Pieced together with the previously determined sequence, this liver- 
derived sequence reveals an open reading frame of 2133-bp predicted to 
25 encode a 710-amino acid protein. 

In addition to the alternatively-spliced brain/liver isoforms, another 
putative splice variant was identified: an insertion of 72-bp coding for 24- 
amino acids within the Dbl homology region is encoded by the 2.6-kb brain 
clone. The sequence encoded by these 24-amino acids is conserved among 
other exchange factors including Tim (Chan et al., 1994, Oncogene, Vol. 9, 
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pages 1057-1063) and Vav2 (Henske et al., 1995, Ann Hum Genet 59, Pt. 1, 
pages 25-37). 

Example 2 
Properties of Rac GEF 

5 Two Rac-GEFs were tested for guanine nucleotide activity. 

Firstly, a Glu-epitope tag (MEYMPMEIRHD) was engineered onto the 
carboxy-terminal 423 amino acids of the Rac GEF encoded by EST No. 167059 
by introducing the oligos 

5'-TCGAGGAGGTTATAAATATGGAATACATGCCAATGGA-3' and the 
10 complementary 

5'-AATTTCCATTGGCATGTATTCCATATTTATAACCTCC-3' into the 
XhoI/EcoRI sites of the clone. The protein encoded by this construct is 
referred to as Type I Rac-GEF. 

Next, the sequence encoding the insertion in the Dbl homology region, 

15 as described in Example 1, was engineered into the open reading frame in the 

expression plasmid pET21a (Novagen). The protein encoded by this construct 
is referred to as Type II Rac-GEF. The resulting expression plasmids were 
introduced into E. coli strain BL21(DE3)pLysS (Novagen), and the epitope 
tagged protein expression was induced with IPTG. The expressed proteins 

20 were purified using a resin with the antibody to the Glu-epitope covalently 

attached. The resulting proteins were partially pure and were assayed for 
exchange activity on Racl, RhoA and Cdc42. See, Hart, in U.S. Serial No. 
60/029,979, filed November 6, 1996. The results showed that Rac GEF is 
primarily selective for Racl, but also displays activity against both RhoA and 

25 Cdc42. Furthermore, the Type I form lacking the Dbl insert region is 

unaffected by the addition of the PH domain ligand ascorbyl stearate, while 
the Type II form containing the Dbl insert region is strongly stimulated by 
ascorbyl stearate. 
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Examp le 3 
Lmmunochenii cal detection 
Antibody specific to Rac-GEF was raised in rabbits against three 
fragments of the purified recombinant molecule. The fragments correspond to 
amino acids 385-398, Type II and amino acids, 372-386 of Type I Rac-GEF 
referred to in Example 2, and 693-710 amino acids of Type U. The peptides 
were coupled to KLH, and antibody raxsed in rabbits using standard 
procedures. 

Examp le 4 

Cloning and Frpression of Tiam-1 ,nH Truncations TWnf 

Cloning and expression of Tiam-1, and various Tiam-1 truncations, is 
described below. This work, and that shown in Examples 5 and 6, was 
undertaken to determine those regions of Tiam-1 that realize GEF enhancer 
stimulation of Rac GEF activity. 

cDNA Cloninp Of Human Tiam-1 ,nH Ti^.i T...^ tirn _. Primers designed 
against published mouse Tiam-1 cDNA sequence (See, Habets, G.G, Scholtes, 
E.H., Zuydgeest, D., van der Kammen, R.A., Stam, J.C., Berns, A., and Collard, 
J.G (1994) Cell 77, 537-549; NCBI Gen Bank Accession #U05245) were used in 
PCR reactions using a human fetal brain library (Stratagene #936206) as 
template to obtain fragments of the human Tiam-1 gene which were 
radiolabled and used as probes in Southern hybridizations of the same library. 
Primer pairs used were both 

5'-CCATAAAACCATGGGAAACGC-3' and 
5'-GGTTCCGCGGAAGAGAAGGAT-3' with 
25 5'-GACTGGCCCGGGGAACTGAGG-3'; and 

5'-TCGGATGCGGATAAGCTGCGC-3' with 

5'-GTGACTGGCGACCTTGTTCAT-3'. Two partial clones of human Tiam-1 
cDNA were retrieved, one contained nucleotides (nt) 1-2972 and the other (nt) 
2972-4657 (numbering throughout corresponds to previously published Tiam- 
1 cDNA (See, Habets, G.G, van der Kammen, R.A., Stam, J.C, Michiels, F., 
and Collard, J. G (1995) Oncogene 10, 1371-1376; NCBI Gen Bank Accession 
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#U16296). To obtain missing C-terminal sequences, a PCR reaction employing 
oligonucleotides designed against the human Tiam-1 cDNA (See, Habets, 
G.G., van der Kammen, R.A., Stam, J.C., Michiels, F., and Collard, J. G. (1995) 
Oncogene 10, 1371-1376; NCBI Gen Bank Accession #U16296) 5'- 
5 CGGAATTCAGATTTCGACACATGATC-3' (sense) and 

5'-TCGCCCGGGGCAGGTGACGCAGTCAGA-3' (antisense, contains Smal 
site downstream of stop codon) as primers and a human hippocampal library 
(Clontech #HL3023b) as template produced a fragment containing nt 4458-5366 
which was added to existing clones using the internal Eco47III (4487) site. A 
10 similar strategy using the antisense primer 

5'-GATCCCGGGTCATGTTTCTGGTTCTGGGATCTCAGTGTTCAGTTTCCTG-3' 
was used to add the KT3 epitope tag "PEPET," a stop codon, and a Smal site to 
the end of Tiam-1. To splice the two partial clones together, a PCR reaction 
using 5'-CGGAATTCCATGGGCCGCCTTGGAATCT-3' (sense) and 5'- 

15 TCGCCCGGGCGTCAGCAGCACGATTAT-3' 

(antisense) as primers, and a human fetal brain cDNA library (Clontech 
#HL50156) as template produced a product spanning nt 2422-3189 which was 
cloned into pBS SK+ (Stratagaene #212201) using EcoRI and Smal. Ncol (472) - 
Ncol (2422) and StuI (3134) -Smal fragments were ligated into this vector, 

20 creating full-length clones, with and without the KT3 tag. 

It is note worthy that the isolated Tiam-1 sequence was altered from the 
published sequence. The 5' clone obtained from the library contained an 
insert of sequence 5'- 

GGTGAGCAGTTTACACTTTCATATACTCCCTGTCATGTGCTTTGAAGGACTTTC 
25 T AGGGGC ATC A AG-3 ' in the upstream non-coding region at nt 105. Original 

clones from the Stratagene fetal brain library as well as all PCR products from 
Clontech hippocampal and brain libraries contained a difference in sequence 
from the published Tiam-1 cDNA ((See, Habets, G.G., van der Kammen, R.A., 
Stam, J.C., Michiels, R, and Collard, J. G. (1995) Oncogene 10, 1371-1376; NCBI 
30 Gen Bank Accession #U16296); a G at nt 3005 instead of a C, which therefore 
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encodes a Gin in position 844 instead of a His. In addition, PCR introduced 
silent mutations G4739A and G5153A. 

Example .5 
Expression nf Tiam-1 and Truncations 
The following expression vectors were constructed and used to express 
the appropriate Tiam-1 constructs. 

Full-length 078 KB) Tiam-V A KT3-tagged 4792 basepair (bp) NcoI (472)- 
Smal fragment was ligated into NcoI-Smal-digested P AcC4 (See, Rubinfeld, 
B., et al. Cell 65, 1033-1042 (1991)): Bio /Technology 6:47-55 (derived from 
pAc436)). 

135 kD Tiam-1: The 5' phosphorylated oligonucleotides 

5'-GTC ATG ATGG-3 ' and 5'-TCCATCATGACGGCC-3' were used as linkers to 
recircularize Apal - EcoNI (1673)-di g ested pBS SK + -ba Se d full-length Tiam-1. 
The linker-created BspHI site and the vector-derived Spel site were used to 
clone the 4006 bp fragment into Ncol-X bal-digested P AcC4 (See, Rubinfeld, 
B., et al. Cell 65, 1033-1042 (1991)). 

l06kDTlam - 1: The Nco1 < 472 ) - NcoI (2422) fragment was removed from the 
full-length pAcC4-based expression vector. 

85 kD Tiam-1: PCR using 

5'-CTTGAATTCCACCATGGAAATCTGTCCAAAAGTCACT-3' (sense) and 
5'-TCGCCCGGGCGTCAGCAGCACGATTAT-3' (antisense) as primers and 
the Stratagene Tiam-1 nt 2972-4657 clone as template was used to create an 
NcoI-StuI (3134) fragment that placed an ATG before nt 2972. The 2297 bp 
Ncol-Smal was ligated into NcoI-Smal-digested P AcC4 (See, Rubinfeld, B., et 
al. Cell 65, 1033-1042 (1991)). 
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66 kD Tiam-1: The 5' phosphorylated oligonucleotides 

5'-CATGGACCAGAACCCATCTCC-3' and 5'-TGAGGAGATGGGTTCTGGTC-3' 
were used as linkers to recircularize Ncol (472) - Bsu36I (3534)-digested pBS 
SK+-based full-length Tiam-1. The linker-regenerated Ncol site and the 
5 vector-derived Spel site were used to clone the 1761 bp fragment into Ncol- 

Xbal-digested pAcC4 (See, Rubinfeld, B., et al. Cell 65, 1033-1042 (1991)). 

APH versions of Tiam-1 : The oligonucleotides 

5'-GCCAGAACCAGAAACATGAC-3' and 

5'-CCGGGTCATGTTTCTGGTTCTGGC-3' were used as linkers to 
10 recircularize Eco47III (4487) and Xmal-digested pAcC4-based expression 

vectors containing the 135 kD, 106 kD, 85 kD, and 66 kD versions of Tiam-1. 
These primers also restored the KT3 tag. 

GST-PH domain fusion proteins: Products from PCR reactions using Tiam-1 
cDNA as template and 

15 5'-GAGGAATTCGATCTGAGCATGGGAGACCTG-3' and 

5'-CTGCTCGAGCTACTTATCACGCAGGATTGAATG-3' (C-terminal PH 
domain) or 5'-CAGGAATTCGTGCGCAAGGCCGGCGCCCTG-3' and 
5'-GTGCTCGAGCTACGCAGTGGCGCAGGCAGAGTG-3' (N -terminal PH 
domain) as primers were cloned into pGEX20T (See, Helin, K., Harlow, E. and 

20 Fattaey, A. (1993) Mol. Cell. Biol. 13, 6501-6508) using EcoRI and Xhol. 

All Tiam-1 constructs, except for the GST fusions, were produced in 
baculoviurs-infected S. frugiperda -9 cells and were purified using KT3-mAb 
immunoaffinity chromatography. See, Schreurs, J., Yamamoto, R., Lyons, J., 
Munemitsu, S., Conroy, L., Clark, R., Takeda, Y., Krause, J.E., and Innis, M. 
25 (1995) /. Neurochem. 64, 1622-1631. GST fusion proteins were produced in 

E.coli and purified using glutathione-agarose. See, Smith, D.B. and Johnson, 
K.S, (1988) Gene 67, 31-40. 
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Example 6 
Stimulator s of Tiam-1 R ac GEF Activity 

The GEF activity of the various Tiam-1 constructs described above was 
determined in the presence and absence of certain compounds. The following 
assay was utilized. Reactions were conducted at room temperature in Buffer 
A (20mM Hepes pH7.3, 50mM NaCl, 2mM DTT, 2mM MgC12). All proteins 
and compounds were diluted to 4x their final concentrations in Buffer A 
(GTPases were diluted in Buffer A containing luM GDP). To dilute Ascorbyl 
Stearate, Ascorbyl Palmitate, and Stearic Acid, 25mM EtOH solutions were 
slowly added to Buffer A while vortexing vigorously. Other lipids were 
resuspended in aqueous solution with vortexing and bath sonication and then 
diluted into Buffer A. Reactions were prepared and at time 0, \-"P-GTP 
(DuPont NEN #NEG006H) was added to 4.5nM, and after 10 minutes 
reactions were stopped by filtering onto nitrocellulose filters (Millipore 
#HAWP02500) and immediately washed with wash buffer (25mM Tris 7.5, 
lOOmM NaCl, 30mM MgC12). Bound X- 32 P-GTP was measured, using 
standard techniques. 

The 85 kD portion of human Tiam-1 protein was produced in insect 
cells and purified by affinity chromatography, as described above. This 
protein contained an intact PDZ domain, Dbl-homology (DH) domain and 
adjacent pleckstrin homology (PH) domain (Fig. 3). Using the above 
described assay, this truncation alone, at various concentrations, exhibited no 
GEF activity towards Rac 1 (Fig. 4). In contrast, ascorbyl stearate (AS) 
stimulated the rate of Tiam-l-mediated nucleotide exchange on Rac 1 (Fig. 4). 
Because AS has the potential to act as a detergent or a reducing agent, other 
detergents (nOG, Trition X-100, NP40) and reducing agents (DTT, TCEP, or 
Tris (2-carboxyethyl)phosphine) were tested and shown not to significantly 
stimulate Tiam-1 GEF activity. Several other lipids were tested to determine 
the specificity of activation. Ascorbyl palmitate (AP), phosphatidylinositol-4- 
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phosphate (PI(4)P), and phosphatidylinositol-4,5-bisphosphate (PI(4 / 5)P2) 
significantly enhanced Tiam-1 activity; phosphatidylinositol-3,4,5- 
trisphosphate (PI(3,4,5)P3) and phosphatidylserine had weak effects; and 
phosphatidylglycerol, phosphatidylinositol and phosphatidylcholine had little 
or no effect (Fig. 5). As a control, experiments were run to determine if IP3, 
IP2, ascorbic acid, stearic acid and ascorbic acid with stearic acid were 
sufficient to activate the GEF activity of Tiam-1. The results showed that these 
reagents were incapable of stimulating GEF activity (Fig. 5). 

It has been previously reported that Tiam-1 has GEF activity in the 
absence absence of lipids. See, Michiels, F., Habets, G.G., Stam, J.C., van der 
Kammen, R.A., and Collard, J.G. (1995) Nature 375,338-340. Those studies 
used a mouse version containing additional upstream sequences, including the 
N-terminal PH domain and the coiled-coil region. To determine if upstream 
regions are necessary for expression of the DH domain GEF activity, the 
corresponding human construct was prepared (Fig. 3). This 135 kD Tiam-1 
truncation shows weak GEF activity towards Rac in the absence of AS, but is 
still greatly stimulated by AS (Fig. 4). Other truncations of Tiam-1, all 
containing the DH and PH domains (Fig. 3), also exhibited AS stimulated GEF 
activity on Rac 1 (Fig. 6). 

To determine if AS binding to the PH domain was responsible for 
activation of Tiam-1 GEF activity, sequences 3' of the Eco47III site were 
deleted, removing half of the C-terminal PH domain as well as the rest of the 
C-terminus (Fig. 3). These truncations of Tiam-1 were not activated by AS, 
including one that contained the N-terminal PH domain (Fig. 4). While it is 
possible that deleting the PH domain destroys activity of the DH domain 
altogether, similar truncations of the PH domain of the Dbl protein do not 
affect its GEF activity (See, Zheng, Y., Zangrilli, D., Cerione, R.A., and Eva, A. 
(1996). /. Biol Chem. 271, 19017-19020). To further determine if the PH 
domains could bind to AS, GST-PH fusion proteins were included in the 
reaction. While GST alone did.not affect AS-stimulated Tiam-1 exchange 
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activity, both of the Tiam-1 GST-PH domain fusions reduced the effectiveness 
of AS (Fig. 6). 

Without further elaboration, it is believed that one skilled in the art can, 
using the preceding description, utilize the present invention to its fullest 
5 extent. The preceding preferred specific embodiments are, therefore, to be 

construed as merely illustrative, and not limitative of the remainder of the 
disclosure in any way whatsoever. 

The entire disclosure of all patents/patent applications and 
publications, cited above and in the figures are hereby incorporated by 
10 reference in the entirety. 

From the foregoing description, one skilled in the art can easily 
ascertain the essential characteristics of this invention, and without departing 
from the spirit and scope thereof, can make various changes and modifications 
of the invention to adapt it to various usages and conditions. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Bollag, Gideon 
Crompton, Anne 
North, Anne 
Roscoe, William 
10 Sharma, Sanju 

(ii) TITLE OF INVENTION: Methods and Compositions for Treating 

Abnormal Cell Growth Related to Unwanted Guanine Exchange 
^ Factor Activity 

'(iii) NUMBER OF SEQUENCES: 2 

(*V) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ONYX Pharmaceuticals, Inc. 

lv (B) STREET: 3031 Research Drive 

(C) CITY: Richmond 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP : 94806 

25 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

30 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

(B) FILING DATE: 17-JUN-1997 

35 (C) CLASSIFICATION: Provisional 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Giotta Ph.D., Gregorv 

(B) REGISTRATION NUMBER: 32,028 

40 (C) REFERENCE/ DOCKET NUMBER: CNYX1028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: {510} 262-8710 

(B) TELEFAX: (510) 222-9758 
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70 



(2) INFORMATION FOR SEQ ID NO : 1 : 



<i> SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 3171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 7 6.. 2208 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TCACTCAAAC CAGTGAAGCT TGGGAAAGTC ATTGACCTCC AGTCGTTCTG CTGAGAAACA 60 
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159 



207 



255 



303 



331 



399 



447 



495 



TCTGGCTCTA TTTCC ATG GAG ACC AGG GAA TCT GAA GAT TTG GAA AAG ACC 
Met Glu Thr Arg Glu Ser Glu Asp Leu Glu Lys Thr 

5 S Arg ser Ala K Sn « f° ^ GAT ** T GAA CCA GCG 

15 r Asp Tr P As * Thr Asp Asn Glu Pro Ala 

20 25 

-o JS 52 K S S 2 2 £ Si £ S K S E Si SI 

=| as g s a 2p k s s s k s S S K 

55 60 
AGA AAT TCC ATC TTC AAT CGC TCC ATA AC A rrr a* a 
Arg Asn Ser ne Phe Asn Arg S lie 2£ £ ™ £ £ £ £S 
20 70 75 

Ala S S £n £ Glu £ £ n S S £ T ai* GAT TCA CAG 
80 o? Ser Cys Leu Ala Asp Ser Gin 

90 

5 Asp Asn S£ £ £ Val j£ Su Pro ™ ^ T ATC GCC ™ G 

95 ?i" Pro ^ su Thr Leu Asn He Pro Trp 

ioo 105 

30 £ Met Pro' Pro C^s j£ £ Ala 2° ^ GAC CCA GGA GCG 

110 7 if? Thr Ala Met Gln Thr Asp Pro Gly Ala 

115 120 

CAG GAA ATG AGT GAG TCG TCr Trr- 

35 «. «. „. e s „ «. s E 2? S S S? K £ E S S 

Si SS S K E 2 s S £ S S g - g SS S 5 " 

40 D iso 155 

S Arg Met £ SS £ £ £ £ 2S T T J" **= CK A " 
160 Asp Ser Tr P Ar 9 Asn Leu He 

165 170 

GAA CAA ATA GGG CTC CTG TAT ran m> - 

Glu Gin He Gly Leu Leu Gin *° A ° AT TCG ACT CTC 

175 ^ ?i" Glu - /r Ar 9 As P Lys ser Thr Leu 

180 185 

50 Sn Glu £ Si Thr Arg Ara Gin S° S" OCA GAA ATA GAA GAC AAT 

wu Tnr Arg Arg Gin Gin Asp Ala Glu lie Glu Asp Asn 

195 200 

Tnr £n gS Ser Pro Ala ^ Su 2* 2? CCG GAG GAG GAA GAA «* 
55 205 P THr Pr ° Glu Glu Glu Glu Glu 

215 2 2o 

Su Glu Su Su Su Glu Pro A C a 2* CCA GAG AGG ™ ACT « 

225 Ala SSr P ™ Pro Glu Arg Lys Thr Leu 

OU 23 0 235 

£ Sn £ S SS 2S S A^n S SS sSr f ° ^ ^ " C TCG 
24 0 o^c er Arg Phe Asn Leu Trp 

245 250 

Sn Asp S £ S G £ S A eS £ S; SE S° ATC CTA CAG 

255 ocJ y Val Leu Glu Ile Leu Gin 

260 265 

70 S Su Su G £ 5S Leu Sn Giu £ K S 2° CTG GTC ACT TCC 

270 ^ie Ala Met Phe Glu Leu Val Thr Ser 

275 280 



591 



639 



6S7 



735 



783 



831 



879 
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GAG GCG TCC TAC TAC AAG AGT CTG AAC CTG CTC GTG TCC CAC TTC ATG 975 

Glu Ala Ser Tyr Tyr Lys Ser Leu Asn Leu Leu Val Ser His Phe Met 
285 290 295 300 

5 GAG AAC GAG CGG ATA AGG AAG ATC CTG CAC CCG TCC GAG GCG CAC ATC 1023 

Glu Asn Glu Arg He Arg Lys He Leu His Pro Ser Glu Ala His He 
305 310 315 

in CTC TCC AAC G TC CTG GAC GTG CTG GCT GTC AGT GAG CGG TTC CTC 1071 

1U Leu phe Ser Asn Val Leu Asp Val Leu Ala Val Ser Glu Arg Ph* Leu 

320 325 330 



20 



40 



45 



50 



60 



65 



70 



CTG GAG CTG GAG CAC CGG ATG GAG GAG AAC ATC GTC ATC TCT GAC GTG 1119 
r_ „i„ r„„ Glu clu Asn 

340 345 



t — — ~< iw ™-»v- ftH- VjrVV_ bid 

1S Leu Glu Leu Glu Hls Arg Met Glu Glu Asn Ile Val lie Ser Asp Val 

I j 33 5 *• * " 



TGT GAC ATC GTG TAC CGT TAT GCG GCC GAC CAC TTC TCT GTC TAC ATC 1167 
Cys Asp Ile Val Tyr Arg Tyr Ala Ala Asp His Phe Ser Val Ty- lie 
350 355 360 



ACC TAC GTC AGC AAT CAG ACC TAC CAG GAG CGG ACC TAT AAG CAG CTG 1215 
Thr Tyr Val Ser Asn Gin Thr Tyr Gin Glu Arg Thr Tyr Lys Gin Leu 
365 370 375 380 

25 CTC CAG GAG AAG GCA GCT TTC CGG GAG CTG ATC GCG CAG CTA GAG CTC 1263 

Leu Gin Glu Lys Ala Ala' Phe Arg Glu Leu He Ala Gin Leu Glu Leu 
385 390 395 



GAC CCC AAG TGC AGG GGG CTG CCC TTC TCC TCC TTC CTC ATC CTG CCT 1311 

Asp Pro Lys Cys Arg Gly Leu Pro Phe Ser Ser Phe Leu Ile Leu Pro 
400 405 410 

TTC CAG AGG ATC ACA CGC CTC AAG CTG TTG GTC CAG AAC ATC CTG AAG 1359 

Lys Leu Leu Val Gin Asn 

.420 425 



30 



35 Phe Gln j£| Ile Thr Arg Leu Lys Leu Leu Val Gin Asn Ile Leu Lys 



AGG GTA GAA GAG AGG TCT GAG CGG GAG TGC ACT GCT TTG GAT GCT CAC 1407 
Arg Val Glu Glu Arg Ser Glu Arg Glu Cys Thr Ala Leu Asd Ala His 
430 435 44Q 

AAG GAG CTG GAA ATG GTG GTG AAG GCA TGC AAC GAG GGC GTC AGG AAA 14 55 

Lys Glu Leu Glu Met Val Val Lys Ala Cys Asn Glu Gly Val Arg Lys ' 
445 450 455 4 £ 0 

ATG AGC CGC ACG GAA CAG ATG ATC AGC ATT CAG AAG AAG ATG GAG TTC 1503 
Met Ser Arg Thr Glu Gin Met Ile Ser He Gin Lys Lvs Met Glu Phe 
465 470 475 

AAG ATC AAG TCG GTG CCC ATC ATC TCC CAC TCC CGC TGG CTG CTG AAG 1551 
Lys lie Lys Ser Val Pro He He Ser His Ser Arg Trp Leu Leu Lys 
480 485 490 



CAG GGT GAG CTG CAG CAG ATG TCA GGC CCC AAG ACC TCC CGG ACC CTG 1599 

s er Gly Pro Lys Thr Ser 
500 505 



_, . , * -rvnu ftLL i^L Lbb ftLL Lib 

Gin Gly Glu Leu Gin Gin Met Ser Gly Pro Lys Thr Ser Arg Thr Leu 

J J 495 c nr\ 



AGG ACC AAG AAG CTC TTC CAC GAA ATT TAC CTC TTC CTG TTC AAC GAC 1647 
Arg Thr Lys Lys Leu Phe His Glu Ile Tyr Leu Phe Leu Phe Asn Asp 
510 515 520 

CTG CTG GTG ATC TGC CGG CAG ATT CCA GGA GAC AAG TAC CAG GTA TTT ' 1695 
Leu Leu Val Ile Cys Arg Gin lie Pro Gly Asp Lys Tvr Gin Val Phe 
525 53 0 535 540 

GAC TCA GCT CCG CGG GGA CTG CTG CGT GTG GAG GAG CTG GAG GAC CAG 1743 
Asp Ser Ala Pro Arg Gly Leu Leu Arg Val Glu Glu Leu Glu Asp Gin 
545 550 555 

G ? C C ? G CTG GCC GTG TTC ATC CTG CGG CTG CTG GAG AAC GCA 1791 

Gly Gin Thr Leu Ala Asn Val Phe Ile Leu Arg Leu Leu Glu Asn Ala 
560 565 ~ 570 
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s s s as s s 5 s s js - e nr s: 2; as »» 

580 585 
5 ATG AAG CGT TGG ATG ACC TCA PTT rrr rrr * *o fc 

»« ». *, Tn. h.« Tta g 2 s 2 i£ 2 £ S K S "■' 

600 

S 2 S S S 2 2 2 2 S | 2 2 2 S 2 - 



15 



55 



65 



70 



S S S - SI 2 ffi S 2 S 2S2 2 2 E ~» 

" = a S 2 2 £ s 2 s sgsSs - 



K 5 i S 2 2 2 2 S S S S £ E 2 S 

660 665 

25 ss s Asn k s s s- ss r r gaa tct 2127 

670 c=7c 9 Ser Gin Asn Leu L y s Glu Cys 

675 680 

30 2 2 5£ 2 2 « « « - « - - - - « „s 

695 700 

,, S S £S 2 S £ 2 S2 2 2 ^ a CCK " K " — «~ «• 



710 

GGGAGCAGGG CCTGCATGAG ACCCCGACAG AAGGTGGGGG GGGGGGGGGG GGCTCTGGGA 
4Q AGCACAGGCC AGCACCTCCC CAGGTGGCAG GATCTGGCTT GGGGTGCCCG GCCCTCATCC 

CTGCCCACGC AGTGAGTGCT CATGTGTCTT GGCCCCTTGC TCGCAAACTG GATAAAGGGT 
GCCCAAGCCT CTCCTGATGC ATTTGTAAAC AAGAAGGTTT CAGCAGTATT ACACCACCTC 
45 CCTCATGCCT CCGAGGGGGT GGAAGGGGGT GGGCACACTC CAGGGCCCCC CATGCCCCTG 

GCCCCCAGGG ATTGGAAGAG GCTCCCAACC CAGAGTGTCC CTG TGGGAGG CAGGCAGAAG 
5Q GTGACAATTG ACACGATTTC CTGCACGCGT CTTCTTTTAC CTTGGAAGCA GTTAGAATTT 

ACCAGGCACA GATGAGGCCG CCCTTGCCTG ACGGAGCTTG ATGAGCAGCC CTTGGTCTCC 
GGTTCCAGGA CTGAGAGCCC AGCTGCCTCT GCCCACCCTT CCCCAGGCCT CTCCCAGCCT 
C^GCTGCAC GGTCAGGCCC TGCCCCATCG CAGGCCTGCC AGAGCTTGGC TGGGGACCCC 
TCCCGCCTCT GGCTCCCTGA TGGGCTGGAT GTAACTTGTG TCTTCTAGCC CCTTAAGGAG 
6Q CCCAGGTGTT TTAAGGAATG AATTGGTCAC TGCATCTTGT ATCGATTATG GTTCTGAGAA 

AAGCAAATAT CGGAATTCCT GCAGCCCGGG AAATGGGGCC ACGCCCGAGG AGTGGCCGGC 
CCTCGCCGAC AGCCCCACCA CGCTCACCGA GGCCCTGCGG ATGATCCACC CCATTCCCGC 
CGACTCCTGG AGAAACCTCA TTGAACAAAT AGGGCTCCTG TATCAGGAAT ACCGAGATAA 
ATCGACTCTC CAAAAAAAAA AAAAAAAAAA GATCTTTAAT TAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

42 
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2408 

2468 

2523 

2588 

2643 

2708 

2768 

2828 

2888 

2948 

3008 

3068 

3128 

3171 
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(A) LENGTH: 711 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met G * u Arg Glu Ser Glu Asp Leu Glu Lvs Thr Arg Arg Lvs Ser 

10 1 5 io ' i 5 

Ala Ser Asp Gin Trp Asn Thr Asp Asn Glu Pro Ala Lys Val Lvs Pro 
20 25 30 

15 Glu Leu Leu Pro Glu Lys Glu Glu Thr Ser Gin Ala Asp Gin Asp He 

35 40 45 

Gin Asp Lys Glu Pro His Cys His He Pro He Lys Arg Asn Ser He 
50 .55 60 

Phe Asn Arg Ser He Arg Arg Lys Ser Lys Ala Lys Ala Arg Asp Asn 
65 70 75 80 

oc Pro Glu Arg Asn Ala Ser Cys Leu Ala Asp Ser Gin Asp Asn Glv Lys 

25 85 90 95 

Ser Val Asn Glu Pro Leu Thr Leu Asn He Pro Trp Ser Arg Met Pro 
100 10 5 no 

Pro Cys Arg Thr Ala Met Gin Thr Asp Pro Gly Ala Gin Glu Met Ser 
115 120 125 

Glu Ser Ser Ser Thr Pro Gly Asn Gly Ala Thr Pro Glu Glu Trp Pro 
130 135 140 

Ala Leu Ala Asp Ser Pro Thr Thr Leu Thr Glu Ala Leu Arg Met He 
145 150 155 160 

An His Pro Iie Pro Ala Asp Ser Trp Arg Asn Leu He Glu- Gin He Gly 

4U 165 . 170 175 

Leu Leu Tyr Gin Glu Tyr Arg Asp Lys Ser Thr Leu Gin Glu He Glu 
180 185 190 

Thr Arg Arg Gin Gin Asp Ala Glu He Glu Asp Asn Thr Asn Glv Ser 
195 200 205 

Pro Ala Ser Glu Asp Thr Pro Glu Glu Glu Glu Glu Glu Glu Glu Glu 
210 215 220 

Glu Glu Pro Ala Ser Pro Pro Glu Arg Lys Thr Leu Pro Gin He Cys 
225 230 235 240 

cc Leu Leu s er Asn Pro His Ser Arg Phe Asn Leu Trp Gin Asp Leu Pro 

55 245 



30 



35 



45 



50 



250 255 

Glu He Arg Ser Ser Gly Val Leu Glu He Leu Gin Pro Glu Glu He 
260 265 270 

Lys Leu Gin Glu Ala Met Phe Glu Leu Val Thr Ser Glu Ala Ser Tyr 
275 280 285 ' 

Tyr Lys Ser Leu Asn Leu Leu Val Ser His Phe Met Glu Asn Glu Arg 
290 295 300 

He Arg Lys He Leu His Pro Ser Glu Ala His He Leu Phe Ser Asn 
305 310 315 320 

- n Val Leu As P Vai Leu Ala Val Ser Glu Arg Phe Leu Leu Glu Leu Glu 

70 325 330 335 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



His Arg Met Glu Glu Asn He Val lie Ser Asp Val Cys Asp lie Val 
340 345 350 

Tyr Arg Tyr Ala Ala Asp His Phe Ser Val Tyr He Thr Tyr Val Ser 
3" 360 365 

Asn Gin Thr Tyr Gin Glu Arg Thr Tyr Lys Gin Leu Leu Gin Glu Lys 
370 375 38Q 

Ala Ala Phe Arg Glu Leu lie Ala Gin Leu Glu Leu Asp Pro Lys Cys 



395 



400 



Arg Gly Leu Pro Phe Ser Ser Phe Leu He Leu Pro Phe Gin Arg He 
405 410 415 

Thr Arg Leu Lys Leu Leu Val Gin Asn He Leu Lys Arg Val Glu Glu 
420 425 43 0 

Arg Ser Glu Arg Glu Cys Thr Ala Leu Asp Ala His Lys Glu Leu Glu 
435 440 445 

Met Val Val Lys Ala Cys Asn Glu Gly Val Arg Lys Met Ser Arg Thr 
4S0 455 460 

Glu Gin Met lie Ser He Gin Lys Lys Met Glu Phe Lys He Lys Ser 



470 



475 



480 



Val Pro He He Ser His Ser Arg Trp Leu Leu Lys Gin Gly Glu Leu 
485 490 



495 



Gin Gin Met Ser Gly Pro Lys Thr Ser Arg Thr Leu Arg Thr Lys Lys 



505 



510 



Leu Phe His Glu He Tyr Leu Phe Leu Phe Asn Asp Leu Leu Val He 
515 520 525 

Cys Arg Gin He Pro Gly Asp Lys Tyr Gin Val Phe Asp Ser Ala Pro 



535 



540 



Arg Gly Leu Leu Arg Val Glu Glu Leu Glu Asp Gin Gly Gin Thr Leu 



550 



5S5 5 60 
Ala Asn Val Phe He Leu Arg Leu Leu Glu Asn Ala Asp Asp Arg Glu 

b 6 5 



570 



575 



Ala Thr Tyr Met Leu Lys Ala Ser Ser Gin Ser Glu Met Lys Arg Trp 



585 



590 



Met Thr Ser Leu Ala Pro Asn Arg Arg Thr Lys Phe Val Ser Phe Thr 
595 600 605 

Ser Arg Leu Leu Asp Cys Pro Gin Val Gin Cys Val His Pro Tyr Val 



615 



620 



Ala Gin Gin Pro Asp Glu Leu Thr Leu Glu Leu Ala Asp lie Leu Asn 



630 



635 



640 



He. Leu Asp Lys Thr Asp Asp Gly Trp lie Phe Gly Glu Arg Leu His 
645 650 655 

Asp Gin Glu Arg Gly Trp Phe Pro Ser Ser Met Thr Glu Glu lie Leu 
660 665 6 7o 

Asn Pro Lys lie Arg Ser Gin Asn Leu Lys Glu Cys Phe Arg Val His 
675 680 685 

Lys Met Asp Asp Pro Gin Arg Ser Gin Asn Lys Asp Arg Arg Lys Leu 
oyu 695 700 

Gly Ser Arg Asn Arg Gin * 
705 710 
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What is claimed : 

1. An isolated Rac-GEF polypeptide or a biologically-active fragment 
thereof. 

2. An isolated Rac-GEF, or a biologically-active fragment thereof, of claim 
1, wherein said polypeptide has a guanine nucleotide exchange activity, 
a specific binding affinity for a guanine nucleotide depleted Rac, or a 
cellular oncogenic transforming activity. 

3. An isolated Rac-GEF or a biologically-active fragment thereof of claim 
1 which is of human. 

4. An isolated Rac-GEF of claim 1 comprising amino acid 1 to amino acid 
711, as set forth in Fig. 1 (SEQ. ID NO: 2). 

5. An isolated biologically-active fragment of Rac-GEF of claim 4 which 
comprises amino acids 273-605. 

6. An isolated Rac-GEF, or a biologically-active fragment thereof, of claim 
15 1, which is substantially purified. 

7. An isolated nucleic acid comprising a nucleotide sequence coding for a 
Rac-GEF polypeptide. 

8. An isolated nucleic acid of claim 7, wherein said coded for polypeptide 
has a guanine nucleotide exchange activity, a specific binding affinity 

20 for a guanine nucleotide depleted Rac, or a cellular oncogenic 

transforming activity. 

9. An isolated nucleic acid of claim 7 which is human. 

10. An isolated nucleic acid of claim 7, wherein the nucleic acid sequence 
codes for amino acid 1 to amino acid 711, as set forth in Fig. 1 (SEQ. ID 

25 NO: 2). 

11. An isolated nucleic acid of claim 7, wherein the nucleotide sequence is 
operably linked to an expression control sequence. 

12. An isolated nucleic acid of claim 7, wherein the nucleic acid comprises a 
naturally-occurring nucleotide sequence. 

30 13. An isolated nucleic acid of claim 7, wherein the nucleic acid codes for 

said polypeptide without interruption. 

45 
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♦ 

An isolated nucleic acid of claim 7, wherein the nucleic acid is DNA or 
RNA. 

An isolated nucleic acid of claim 7, wherein the nucleic acid further 
comprises a detectable label. 

An isolated nucleic acid of claim 7, except where one or more amino 
acid positions are substituted or deleted, or both, and the polypeptide 
coded for by the nucleic acid is biologically-active. 
An isolated nucleic acid of claim 16, wherein the biological activity is a 
guanine nucleotide exchange activity, a specific binding affinity for a 
guanine nucleotide depleted G-protein, or a cellular oncogenic 
transforming activity. 

An isolated nucleic acid of claim 16, wherein the one or more 
substituted amino acid positions are substituted by conservative amino 
acids. 

An isolated nucleic acid of claim 16, wherein the one or more 
substituted amino acid positions is in the Dbl homology domain or the 
pleckstrin homology domain. 

An isolated nucleic acid comprising a nucleotide sequence which 
hybridizes, or whose nucleic acid complement hybridizes, under 
stringent conditions to base pairs of nucleotide sequence 900-1482 as set 
forth in Fig. 1 (SEQ. ID NO: 1). 

An isolated nucleic acid claim 20 comprising at least 95% nucleotide 
sequence identity to the nucleotide sequence set forth in claim 20. 
An isolated nucleic acid of claim 20, wherein said nucleic acid codes for 
a polypeptide having a guanine nucleotide exchange activity, a specific 
binding affinity for a guanine nucleotide depleted Rac, or a cellular 
oncogenic transforming activity. 

An isolated nucleic acid comprising a nucleotide sequence which is 
unique to Rac-GEF. 
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24. An isolated nucleic acid comprising a nucleotide sequence which 
hybridizes, or whose nucleic acid complement hybridizes, under 
stringent conditions to the unique nucleotide sequence of claim 23. 

25. An isolated nucleic acid of claim 24 which codes for a polypeptide 
5 having a guanine nucleotide exchange activity, a specific binding 

affinity for a guanine nucleotide depleted Rac, or a cellular oncogenic 
transforming activity. 

26. A method of expressing in transformed host cells, a Rac-GEF 
polypeptide coded for by a nucleic acid, comprising culturing 

10 transformed host cells containing a nucleic acid according to claim 7 

under conditions effective to express the polypeptide. 

27. A method of expressing, in transformed host cells, a polypeptide coded 
for by a nucleic acid, comprising culturing transformed host cells 
containing a nucleic acid according to claim 20 under conditions 

15 effective to express the polypeptide. 

28. A method of claim 26, further comprising isolating the polypeptide. 

29. A method of claim 26, further comprising modulating expression of the 
polypeptide. 

30. An isolated polypeptide produced by a method of claim 26. 
20 31. An isolated polypeptide produced by a method of claim 27. 

32. A transformed host cell containing a nucleic acid of claim 7. 

33. A transformed host cell containing a nucleic acid of claim 20. 

34. A vector comprising a nucleic acid of claim 7. 

35. A vector comprising a nucleic acid of claim 20. 

25 36. A method of modulating an activity of a Rac polypeptide comprising, 

administering an effective amount of a Rac-GEF polypeptide or a 
biologically-active fragment thereof, or an effective amount of a 
compound which modulates the activity of the Rac-GEF. 
37. A method of claim 36, wherein the Rac-GEF, or biologically-active 

30 fragment thereof, comprises an amino acid sequence which has a 
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specific binding activity for a guanine nucleotide depleted state of said 
Rac. 

38. A method of modulating an activity of a Rac polypeptide comprising; 

introducing a nucleic acid of claim 21 into said cell under 
5 conditions whereby said nucleic acid is expressed in an effective 

amount to modulate said activity of Rac in said cell. 

39. A method of claim 38 wherein said nucleic acid oncogenically 
transforms said cell. 

40. A method of isolating a molecule that binds to a guanine nucleotide- 
10 depleted state of a Rac polypeptide comprising; 

contacting a Rac polypeptide with a medium comprising said 
molecule under conditions effective for said molecule to bind to said 
Rac polypeptide; and 

separating said Rac polypeptide to which said molecule has 
15 bound from said medium. 

41. A method of claim 40, wherein said molecule is Rac-GEF. 

42. A method of claim 40, wherein said molecule has a molecular weight of 
about 82.5 kilodaltons. 

43. A method of claim 40, further comprising separating said molecule 
20 from said Rac polypeptide. 

44. A method of modulating an activity of a GTPase comprising, 
administering an effective amount of a guanine nucleotide exchange 
factor or a biologically-active fragment thereof, or an effective amount 
of a compound which modulates the activity of the guanine nucleotide 

25 exchange factor. 

45. A method of claim 44, wherein the guanine nucleotide exchange factor, 
or biologically-active fragment thereof, comprises an amino acid 
sequence which has a specific binding activity for a guanine nucleotide 
depleted state of said GTPase. 
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A method of testing for an agent which modulates the guanine 
nucleotide exchange activity of a guanine nucleotide exchange factor 
comprising: 

contacting a mixture of (a) a polypeptide comprising a guanine 
nucleotide exchange factor, or a biologically-active fragment thereof, 
and (b) a polypeptide comprising a GTPase, or a biologically-active 
fragment thereof, to which the exchange factor can bind, with an agent; 
and 

assaying for the presence or amount of guanine nucleotide 
exchange activity in the presence or absence of a GEF enhancer. 
A method of claim 46, wherein the GTPase is Rac. 
A method of claim 47, wherein the guanine nucleotide exchange factor 
is Rac GEF and paid GEF enhancer is ascorbyl stearate. 
An agent identified by the method of claim 48. 

A method of testing for an agent which modulates the binding between 
a guanine nucleotide exchange factor and a GTPase comprising: 

contacting a mixture of (a) a polypeptide comprising a 
guanine nucleotide exchange factor, or a biologically-active fragment 
thereof, and (b) a polypeptide comprising a GTPase, or a biologically- 
active fragment thereof, to which the exchange factor can bind, with an 
agent; and 

detecting the presence or amount of binding between the 
guanine nucleotide exchange factor polypeptide, or the biologically- 
active fragment thereof, and the GTPase. 
A method of claim 50, wherein the GTPase is Rac. 
A method of claim 50, wherein the guanine nucleotide exchange factor 
is Rac-GEF. 

An isolated agent identified by the method of claim 50. 

An isolated antibody which is specific for a Rac-GEF or a peptide 

comprising a sequence present therein. 
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An isolated antibody of claim 54, which binds to an amino acid 
sequence selected from the group consisting of 

H.NAFRELIAOLELDPKCOOH 

H 2 NYQERTYKLPFSSFLCOOH 

H : NPQRSQNKDRRKLGSRNRQCOOH 
A method of increasing the guanine nucleotide exchange activity of a 
guanine nucleotide exchange factor, or a biologically-active fragment 
thereof, said factor capable of acting on a member of the Ras 
superfamily of GTPases, comprising the steps of: 

contacting said guanine nucleotide exchange factor, or a 
biologically-active fragment thereof, with said member of the Ras 
superfamily of GTPases, or a biologically-active fragment thereof; and 

assaying for guanine nucleotide exchange activity under 
appropriate conditions in the presence of a guanine nucleotide 
exchange factor enhancer. 

A method as described in claim 56 wherein said member of the Ras 
superfamily of GTPases is Rac and said guanine nucleotide exchange 
factor is Tiam-1. 

A method as described in claim 57 wherein said GEF enhancer is 
selected from the group consisting of ascorbyl stearate, ascorbyl 
palmitate, and phosphoinositol. 

A method as described in claim 58 wherein said phosphoinositol is 
selected from the group consisting of PI3,4,5P 3 , PI4,5P 2 and PI4P. 
A method of assaying for a compound to treat disease resulting from 
increased guanine nucleotide exchange activity of a guanine nucleotide 
exchange factor, or a biologically-active fragment thereof, said factor 
acting on a member of the Ras superfamily of GTPases, comprising the 
steps of: 

contacting said guanine nucleotide exchange factor, or a 
biologically-active fragment thereof, with said member of the Ras 
superfamily of GTPases, or a biologically-active fragment thereof; 
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assaying for guanine nucleotide exchange activity under 
appropriate conditions in the presence of a guanine nucleotide 
exchange factor enhancer, and in the presence and absence of said 
compound; and 



5 determining if said compound decreases said guanine nucleotide 

exchange activity. 

61. A method as described in claim 60 wherein said member of the Ras 
superfamily of GTPases is Rac 

62. A method as described in claim 61 wherein said guanine nucleotide 
10 exchange factor is Tiam-1. 

63. A method as described in claim 62 wherein said GEF enhancer is 
selected from the group consisting of ascorbyl stearate, ascorbyl 
palmitate, and phosphoinositol. 

64. A method as described in claim 63 wherein said phosphoinositol is 
15 selected from the group consisting of PI3,4,5P 3 , PI4,5P 2 and PI4P. 

65. Compounds of claim 60 that decrease said guanine nucleotide exchange 
activity. 

66. Ligands that bind to the Src homology 3 domain on Rac-GEF. 

67. Ligands that bind to the Src homology 3 domain on Rac-GEF identified 
20 by the methods of claims 46 or 50. 
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Liver Rac GEF Map (1 > 31 71 ) Site and Sequence 

Enzymes : All 440 enzymes (No Filter) 

Settings: Linear, Certain Sites Only, Standard Genetic Code 



:ACTCAAACCAGTGAAGCT-2GGAAAGTCATTGACCTCCA37CG'TCTGC-3AGAAACATCTGGCTCTATTTCC 
■ 1 ^ ■ ' ■ i . , ■ , . . 75 

'ggagaccagggaatctgaagatttggaaaagacccggaggaaatcagcaagtgatcaatggaacactgataat 

1 ' 1 1 1 ■ ■ ' ■ ' «- 150 



■a.a. 1-1271 



'Liver Specific Sequence- 



rs: Giu Thr Arg Giu Ser G!u Asd Leu Giu Lys Thr Arg Arg Lys Ser Ala Ser Asd Gin Tro Asn Thr Asd Asn 

gaacc agccaaggtgaaacc"3agttactcc:agaaaaagaggagacttct:aagc7gaccaggatatccaagac 

■ 1 1 1 ' " 1 ■ ' ■ ' - 225 



•a.a. 1-127* 



-Liver Specific Sequence - 



ro A:a Lys Vc Lys pr C 3.u Lea Leu P-o Giu Lys 3.u G.u >r Ser 3 n A-a Asd G!n Asd He 3!n Asd 

AGC:':AT"3::ACA^:r:A^"T AAGAGAAA"CCA':"":AATCGC"::ATAAGACGCAAAAGCAAAGC: 

J " ' *~ ' ■ - J ' 1 L 300 



•a.a. 1-1271 



•Liver Specific Sequence - 



.5 ou r-c H:s Lys -JS lie !le Lv5 Arg Asr 5er lie Pre Asn Arg Ser lie Arg Arg Lys Ser lvs Aiq 

-33c:a3A3acaa::ccgaa:33aacgc:agc:3::tg3ca3a":acag3acaa7ggaaaatctgtaaa7gag 

■ i . . . ' ' ■ « 375 



■a.a. 1-1271 



• Liver Specific Sequence ■ 



_/3 Aia Arg Asd Asn 2 ro Giu Arg Asn A;a Ser Cys Leu A>a Asd Ser Gin Aso Asn Gly Lys Ser Vai Asn Giu 

:::ctgaccttgaa:atccc:"3gagcagaa7gcct::7"gcagaacagcaatgcagacagacccaggagcccag 

1 > . . 1— « , 1 ■ . , l 



•a.a. 1-127" 



•Liver Specific Sequence - 



--a Leu 7-r Leu Asn He Pro 7rp Ser Arg Met Pro Pro Cys Arg 7h,r Ala y et Gin 7hr Asd Pro G!y Aia Gin 

jAAA7GAGTGAGTC37CCTCCACC3C3GGAAATGGGGCCACGCCCGAGGAG"GGCCGGCCCT3GCCGAC AGCCCC 
. . . . , , , _ , , , . 525 

- 1 

7.u Met Ser G!u Ser Ser Ser 7nr =>ro Gly Asn Giy Aia 7r- D ro Giu Giu >d Pro Ala Leu Ala Asd Ser Pro 
aCCACGC'CACCGAGGCCCTGCGGATGATCCACCCCATTCCCGCCGAC TCC'GGAGAAACCTCATTGAACAAATA 

■ i — « . 1 , , , , , , , L 

~— 7hr Leu 7hr Giu Ala Leu Arg !"et lie His Pro ile Pro Ala Asp Ser >d Arg Asn Leu ile Giu Gin lie 

3GGCTCCTGTATCAGGAATACCGAGATAAATCGACTCTCCAAGAAATCGAAACCAGGAGGCAACAGGATGCAGAA 
. 1 — , . 1 . «- , l- i 1 , 1 u 675 

S v Leu Leu 7yr Gin Giu 7yr Arg Asp Lys Ser Thr Leu Gin Giu He Giu "nr Arg Arg Gin Gin Asa Ala Giu 

-7AGAAGACAATAC3AATGGG7CCCCGGCCAGTGAGGACACCCCGGAGGAGGAAGAAGAAGAGGAGGAGGAGGAG 
. , i , . . , — — , , , , , , l 75 0 



3 a.a. 216-226* 



Poiy-Glutamale Region 

lie Giu Asd Asn Th- Asn Giy Ser Pro Ala Ser G-u Asp Thr Pro Giu Giu G!u Giu Giu Giu Giu Giu Giu Giu 

3AGCCGGCCAGCCCACCAGAGAGGAAGACTCTGCCCCAGATCTGCCTGCTCAGTAACCCCCACTCAAGGTTCAAC 
. 1 ■ 1 ■ ' . i , 1 , . , 1 . 8 25 

'alhaa 227-231 H 
■Po*- SHS -Binding Site 
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G, Pro .o Ser Pro Pro Giu *, Lys ^^^^Z^^^^ qQ0 
^'^TGGCASGATCTTCCCGAGATC CGGAGCAGCGGGG i GC i jAoA . ^.CTA>.«Gv- b, , 900 

I ■ 

- r .» Leo *o * *~> » "» « ile l " °" ? ™ C ' U °" '** G '" 



■ a a 273-456" 



- Obi Homology Domain- 



« « r. » u- . - *■ *, ^ v v.* J- JS^^T. 

3 - 5 AACGA3CGGATAASSAAG^5£2CA^£3 - ^ 1 , Q50 



is- 



— a. a. 273-456 — - 
• Dbl Homology Domain - 



L o-j p^e ^sr* 



Asr. vo ueu Aso vq- Leu 



-3 w ~ J " 



« - - - - ° ° 



z'j*-*----**- - J -* r , --'25 



> a*. 271-456' 



Dbl Homology Domain - 



-s-Arq^- 3- -u Asn ile Vai ile Ser Asd Vo! Cys Asp 



>aa. 273-456' 



- Dbl Homology Domain - 



- ai Ai Acn h-« Phe Ser Vol Tyr ^ie >r"7yr Vol Ser Asn Gin Thr Tyr Gin Glu Arg 
!!e Vo. Tyr Arg ,yr A,o Ala Asp _ ^ T^CCGGGAGC TfiATCGCGC AGCTAGAGCTCGACCCCAAGTGC ^ 



ACCTATAAGCAGCTGCTCCAGSAGAAGGCAGC^ 



■ a.a. 273-^56" 



• Obi Horroiogy Dorr.a:n- 



~~r Tyr LyS Gin Leu Leu Gin Glu 



, iu Ly s MO A,o Pre Arg G.u Leu lie Alo Gin Leu Glu Leu Asp Pro Lys Cys 
a.a. 379-402 
Dbl Insert Region - 



ar;r.Kr,SCTGCCC7TCTCCTCC7TCCTCATCC-5^ 



r.TTTCCAG AGGATC ACACGCC T C AAGCTGTTGGTCCAGAAC ^ ^ 



'a.a. 273-J56' 



■Dbi Homology ucmam — - 

, Phe G:r. Arg ile Thr Arg Leu Lys Leu Leu Vol Gin Asn 



4rg Gly Leo Pro Phe Ser Ser Phe Leu He Leo Pro I 
• 1 

— .|>.>->ri-r'PTiT »^---TTTp,ftftTGCTCACAAGGAGCTGGAAATGGTG ^ 




Obi Hcnr.oicay Domain- 



FIGURE lb 



SUBSTITUTE SHEET (RULE 26) 



WO 98/57990 



PCT7US98/1239! 



. 3/9 

■»a.a. 273-455^— f 

• Dbl Homology Domain J 

Voi Lys Ala Cys Asn Glu Gly Vol Arg Lys Met Ser Arg Thr G.'u Gin Met ile Ser lie Gr Lys Lys Met Glu 

t tcaagatcaagtcggtgcccatcatctcccac:cccgctggc:gc:gaagcagggtgagc-:agcagatgtca 



' Pie.ck^«s Honnology Domain • 



G!y Pro Lys Thr Ser Arg Thr Leu Arg T hr Lvs Lys Leu Phe h, s Gu ile Tyr Leu Phe Lej Pre Asn Asd Leu 
L jo G A « oCloGv-AG^ ! — AGGAG^_~~3-ACCAGGTAT~ T GA3^AGC~2C3w3GGGAC~3CTGCGTG7G 



?575 



Pieck&Vno HOmoto^y Dome in 

Phe Lvs !le LyS Ser vol Pro Ue lie Ser His Ser Arg Trp Leu Leu Lys Gin G>.y Glu Leu Gin Gin Met Ser 
SGCC CC AAGACCTCC C3GACC C'GAGGACCAAGA A GCTCTTCCACG A AATTTACCTCTTCC7GTTC A AC GACCTG 



1650 



'725 



. -»r a 



,r j-p 



J o O O M „ 



. - J _ o . 



J J — - - 



?r - -3 A-g 3 * _e j {_ej A 



:soo 



G:u G:u Leu Glu Aso G:n Giy 3-^ rhr Leu A a Asn Voi P*.e lie Leu Arg Leu Leu Giu Asn A:a Asp Asd Arg 
GAGGCCAC:taCATGCTAAAGG:3TCC'CTCAGAGTGAGATGAAGC3TTGGATGACCTCACTG3CCCCCAACAGG 

. . , _U- , I , , , . , . . 1875 



Giu Ala Thr Tyr Met Leu Lys A!a Ser Ser Gin Ser Glu Met LyS Arg Tro Met Thr Ser Leu Ala Pro Asn Arg 
AGGACCAAGTTTGTTTCGTTCACATCCCGGCTGCTGGACTGCCCCCAGGTCCAGTGCGTGCACCCATACGTGGCT 



1950 



■a.a. 608-680" 



1 Src Homology 3 Domain 

Arg Thr Lys Phe Vol Ser Phe Thr Ser Arg Leu Leu Asp Cys Pro Gin Vol Gin Cys Vol H;s Pro Tyr Vol Ala 
CAGCAGCCAGACGAGCTGACGCTGGAGCTCGCCGACATCCTCAACATCCTGGACAAGACTGACGACGGGTGGATC 



2025 



■a.a. 608-680 1 



•Src Homology 3 Domain- 



Gin Gin Pro Asp Glu Leu Thr Leu Glu Leu Aia Asd Ile Leu Asn ile Leu Asd Lys Thr Asd Asp Gly Trp Ile 

tttggcgagcgtctgcacgaccaggagagaggctggttccccagct:catgactgaggagatcttgaatc:caag 



2100 



"a.a. 608-680" 



- Src Homology 3 Domain - 



Phe Giy G'u Arg Leu nis Asd Gin Glu Arg Giy Tro Phe Pro Ser Ser Met Thr Giu Glu lie Leu Asn Pro Lys 

atccggtcccagaacctcaaggaatgtttccgtgtccacaagatggatgaccctcagcgcagccagaacaaggac 

, , , ■ , , , . , 1 . 2175 

■—a.a. 608-680 "H 
• Src Homoiogy 3 D * 

ile Arg Ser Gin Asn Leu Lys Glu Cys Phe Arg Vol His Lys Met Asp Asd Pro Gin Arg Ser Gin Asn Lys Asp 

CGCAGGAAGCTGGGCAGCCGGAATCGGCAATGACCCCCACCCAGGGGGCCAGCGGGAGCAGGGCCTGCATGAGAC 
, , l— — ... i , 1 , 1 , 1 . . , l 2250 
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Brain-Specific Sequence Map (1 > 198) Site and Sequence 
Enzymes : All 440 enzymes (No Filter) 

Settings: Linear, Certain Sites Only, Standard Genetic Code 

3AATTCCCGCAGCCCGT7AGTC3CCCCCGACCCAGCCCAGGGCCCCGGCGTGGCCCCAGACCCGGCCCCAGCACC 
i . > ■ ■ ' ■ ■ ' ■ 1 — 1 ■ — - 75 

r Brain Specific Sequence— 

|m — — — — — — — ia.a. 1-66— 

Glu Phe Pro G :r% - Va: Ser Arg Arc 3 *-c Ser ? ro Gv Pro Arg Arg Giy Pro Arg Pro Gly Pro Ser Th- 



CGCCCCGCCGC AGACCCTAT3GAGC T3CTGGCC jCTGCC""CAGCGCCGCCTGC3CCGTGGACCACGACAGTTCC 
, . ■ . . . ■ . ■ ■ ■ - »— ■ u 150 

■ — —Brain Specific Sequence — — 

— i^^m^^™ a .a. 1 -66 ™ 

Arg Pro Ala Ala Asd Pro Met Giu Leu Leu Ala Ala Ala Phe Ser Ala Ala Cys Ala Vol Asp His Asp Ser Ser 
Z i . -j ■ 1 1 1 ■ — ' 1 ' L 

ACCTCGGAAAGCGACGCGCGCGAC TCGGCGGCGGGACACCTGCCCGGC 
, , , , , . , . - » 198 

Brain Specific Sequence j 

———a.a. 1-66 I 
Thr Ser Glu Ser Asp Ala Arg Asp Ser Ala Ala Gly His Leu Pro Gly 
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a.ii.'4() - aalSVI 



Figure 3 



SUBSTITUTE SHEET (RULE 26) 



WO 98/57990 



PCT/US98/12391 



. 7/9 
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